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COMPOSITIONS AND METHODS FOR MODULATING DHR96 



I. BACKGROUND 

1 . The control of insects with toxins (pesticides) is one of the largest industries in the 
world. Insects have evolved many methods to deal with pesticides, most of which act through a 
5 xenobiotic detoxification pathway. The regulation of the xenobiotic pathway represents an 

attractive target for pesticides. Disclosed herein, DHR96, a Drosophila gene is shown to regulate 
the xenobiotic pathway, and inhibition of the DHR96 gene expression or activity decreases the 
ability of Drosophila to adapt to toxins, including pesticides, such as DDT. 

O. SUMMARY 

10 2. Disclosed are methods and compositions related to compositions and methods for 

regulating DHR96 and increasing the effect of existing any toxins to control insects are 
disclosed. 

ID. BRIEF DESCRIPTION OF THE DRAWINGS 

3. The accompanying drawings, which are incorporated in and constitute a part of this 
1 5 specification, illustrate several embodiments and together with the description illustrate the 

disclosed compositions and methods. 

4. Figure 1 shows DHR96 is closely related to the PXR/CAR/VDR subfamily of 
xenobiotic receptors. An alignment using the programs PHYLEP and CLUSTALW is depicted 
of the DHR96, DAF-12, PXR, CAR, and NHR-8 nuclear receptors, showing the percent 

20 identical amino acids within either the DNA binding domain (green) or ligand binding domain 
(purple). 

5. Figure 2 shows DHR96 is expressed in organs involved in nutrient absorption, 
metabolism, and excretion. Organs were dissected from wandering third instar larvae, fixed in 
25% formaldehyde and stained with affinity-purified antibodies to detect DHR96 protein. In 

25 wild type larvae, nuclear DHR96 protein is detected in the fat body, in salivary glands and 
regions of the digestive tract including the gastric caece and the Malpighian tubules. Only 
background staining is detected in other tissues, including the imaginal discs and brain. No 
expression was detectable in fat bodies dissected from DHR96 E2S mutant larvae, demonstrating 
the specificity of the antibody stains. 

30 6. Figure 3 shows a strategy for targeted mutagenesis of the DHR96 locus. Al depicts 

the start methionine deletion and A2 depicts the deletion of the fourth exon/intron of DHR96, A 
transgene containing the targeting construct and the GFP marker was circularized by FLP 
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recombinase and subsequently cut with 1-Scel. Homologous pairing between the targeting 
construct and the endogenous DHR96 locus results in the generation of a tandem duplication by 
'ends-in' recombination. To generate a single copy insertion, the tandem duplication was 
reduced by means of homologous recombination by inducing a DNA double stranded break with 
5 I-Oel. 

7. Figure 4 shows DHR96 mutants are more sensitive than wild type flies to the 

5 pesticide DDT. Figure 4A shows a dose response curve. Twenty wild type or DHR96 E2S mutant 
flies were exposed to eight DDT concentrations, from 0.78 to 100 ng/ul, and then scored for 
survival 10 hours later. Figure 4B shows a time course. 20 wild type or DHR96 E2S mutant flies 
10 were treated with a high concentration of DDT (100 ng/ul) and assayed for survival every hour 
up to 10 hours. Each assay (A+B) was done in triplicate to determine the standard deviation as 
shown by the error bars. 

8. Figure 5 shows an alignment of Drosophila nuclear hormone receptor DNA-binding 
domains. An alignment of the DNA-binding domains of known Drosophila nuclear hormone 

15 receptor superfamily members reveals two regions of conserved amino acids flanking a central 
unique region. The conserved amino acids were used to design PCR primers for amplifying 
fragments of Drosophila receptors: F3, F4, F5, R4, R5, R6 and R8. The unique region was used 
to design gene-specific oligonucleotide probes to eliminate previously identified family members 
from further study. 

20 9. Figure 6 shows alignments of DNA-binding domain sequences. The DNA-binding 

domain sequence of each gene was used to search the PIR/Swiss Prot/GenBank databases. An 
alignment of each sequence with representative matches from the databases is presented. Shaded 
boxes indicate identity with the hew protein sequence, and the percent identity is shown to the 
right of each sequence. 

25 1 0. Figure 7 shows temporal profiles of DHR38, DHR78, and DHR96 transcription 

during the onset of metamorphosis. Northern blots containing RNA samples isolated from 
staged third instar larvae and prepupae collected at 2 hr intervals were probed to detect DHR38, 
DHR78, and DHR96 mRNAs. These blots have been used previously for detailed studies of 
20E-regulated gene transcription ((Andres, A. J., Fletcher, J. C, Karim, F. D. & Thummel, C. S. 

30 (1993). Dev. Biol. 160, 388-404) One set of blots was sequentially stripped and hybridized with 
probes from each gene, in order to allow direct comparison of transcription patterns. The blots 
were also hybridized to detect rp49 mRNA, as a control for equal loading (data not shown)). 
Developmental times are shown at the top as hours after egg laying for third instar larval 
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development, and as hours after puparium formation for prepupal and pupal development. 
Landmark 20E-triggered developmental transitions are shown at the top. 

1 1 . Figure 8 shows a time course of DHR38, DHR78, and DHR96 transcription in 
cultured larval organs treated with 20E. Mass-isolated late third instar larval organs were treated 

5 with 5x10-7 M 20E for the times shown, as described (Thummel, C. S., Burtis, K. C. & 

Hogness, D. S. (1990). Cell 61, 101-1 1 1) Equal amounts of total RNA isolated from each time 
point were fractionated by formaldehyde agarose gel electrophoresis, transferred to a nylon 
membrane, and hybridized with probes to detect DHR38, DHR78, DHR96 and rp49 mRNA. 
One northern blot was sequentially stripped and hybridized with a probe from each gene, in order 

10 to allow direct comparison of transcription patterns. Detection of DHR38 transcripts required 
the use of an anti sense RNA probe. 

12. Figure 9 shows the DNA-binding specificities of DHR38, DHR78, and DHR96 
protein. Each protein was overproduced in E. coli, purified, and tested for its ability to bind to 
eight oligonucleotides using electrophoretic mobility shift assays. The names of each 

15 oligonucleotide are shown at the top. In all cases, binding could be competed by the addition of 
an excess of the appropriate unlabelled oligonucleotide (data not shown). 

13. Table 3 shows DHR96 regulates genes involved in detoxification. Control larvae and 
larvae carrying a hs-DHR96 transgene were heat-shocked at 6 hours before pupariation and 
collected as white prepupae. Total RNA was extracted and purified to allow probe generation. 

20 Afrymetrix microarray chips were hybridized with the probes and scanned. Raw data was 

analyzed with dCHIP, and filtering was performed in MS ACCESS. This table depicts the top 
20 genes that are reduced in their expression in hs-DHR96 transformants compared to control 
animals. The corresponding fold change is also shown. Red: Members of gene families known 
to be involved in detoxification in insects. 

25 IV. DETAILED DESCRIPTION 

14. Before the present compounds, compositions, articles, devices, and/or methods are 
disclosed and described, it is to be understood that they are not limited to specific synthetic 
methods or specific recombinant biotechnology methods unless otherwise specified, or to 
particular reagents unless otherwise specified, as such may, of course, vary. It is also to be 

30 understood that the terminology used herein is for the purpose of describing particular 
embodiments only and is not intended to be limiting. 
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A. Definitions 

1 5. As used in the specification and the appended claims, the singular forms "a," "an" and 
"the" include plural referents unless the context clearly dictates otherwise. Thus, for example, 
reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, and the 

5 like. 

16. Ranges can be expressed herein as from "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment includes 
from the one particular value and/or to the other particular value. Similarly, when values are 
expressed as approximations, by use of the antecedent "about," it will be understood that the 

10 particular value forms another embodiment. It will be further understood that the endpoints of 
each of the ranges are significant both in relation to the other endpoint, and independently of the 
other endpoint. It is also understood that there are a number of values disclosed herein, and that 
each value is also herein disclosed as "about" that particular value in addition to the value itself. 
For example, if the value "10" is disclosed, then "about 10" is also disclosed. It is also 

15 understood that when a value is disclosed that "less than or equal to" the value, "greater than or 
equal to the value" and possible ranges between values are also disclosed, as appropriately 
understood by the skilled artisan. For example, if the value "10" is disclosed the "less than or 
equal to 10"as well as "greater than or equal to 10" is also disclosed. It is also understood that 
the throughout the application, data is provided in a number of different formats, and that this 

20 data, represents endpoints and starting points, and ranges for any combination of the data points. 
For example, if a particular data point "10" and a particular data point 15 are disclosed, it is 
understood that greater than, greater than or equal to, less than, less than or equal to, and equal to 
10 and 15 are considered disclosed as well as between 10 and 15. 

17. References in the specification and concluding claims to parts by weight, of a 

25 particular element or component in a composition or article, denotes the weight relationship 

between the element or component and any other elements or components in the composition or 
article for which a part by weight is expressed. Thus, in a compound containing 2 parts by 
weight of component X and 5 parts by weight component Y, X and Y are present at a weight 
ratio of 2:5, and are present in such ratio regardless of whether additional components are 

30 contained in the compound. 

1 8. A weight percent of a component, unless specifically stated to the contrary, is based 
on the total weight of the formulation or composition in which the component is included. 
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19. In this specification and in the claims which follow, reference will be made to a 
number of terms which shall be defined to have the following meanings: 

20. "Optional" or "optionally" means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where said event 

5 or circumstance occurs and instances where it does not. 

21 . "Primers" are a subset of probes which are capable of supporting some type of 
enzymatic manipulation and which can hybridize with a target nucleic acid such that the 
enzymatic manipulation can occur. A primer can be made from any combination of nucleotides 
or nucleotide derivatives or analogs available in the art which do not interfere with the enzymatic 

10 manipulation. 

22. "Probes" are molecules capable of interacting with a target nucleic acid, typically in a 
sequence specific manner, for example through hybridization. The hybridization of nucleic acids 
is well understood in the art and discussed herein. Typically a probe can be made from any 
combination of nucleotides or nucleotide derivatives or analogs available in the art. 

15 23. Throughout this application, various publications are referenced. The disclosures of 

these publications in their entireties are hereby incorporated by reference into this application in 
order to more fully describe the state of the art to which this pertains. The references disclosed 
are also individually and specifically incorporated by reference herein for the material contained 
in them that is discussed in the sentence in which the reference is relied upon. 

20 B. Compositions and methods 

24. Higher organisms neutralize environmental toxins or xenobiotics through enzymes 
that include cytochrome p450 monooxygenases, glutathione transferases, carboxylesterases, and 
UDP-glucuronosyl transferases. In mammals, some of these detoxification enzymes are directly 
regulated by the nuclear receptors PXR and CAR, which in turn are activated by a broad 

25 spectrum of xenobiotics including prescription drugs, plant toxins and other contaminants. In 
contrast, there is little understanding of how similar xenobiotic responses might be controlled in 
insects. Herein it is shown that mutants in the DHR96 nuclear receptor of Drosophila are viable 
and fertile under standard laboratory conditions, as are flies that widely express double stranded 
DHR96 RNA (RNAi) from a transgene. However, when exposed to a pesticide like DDT, 

30 mutant animals are less resistant to the insecticide challenge, dying more rapidly and at lower 
concentrations than control animals. Unlike many other nuclear receptors, widespread ectopic 
expression of DHR96 has no effect on the viability of larvae or flies, suggesting that activation 
of DHR96 is ligand-dependent. Disclosed herein, DHR96 is expressed in tissues that have been 



— 5 — 



Attorney Docket Number 21101 .0053 U 1 

associated with the detoxification process, including the gastric caeca, the major site of 
absorption in Diptera, and the fat body, the insect equivalent of the liver. Microarray studies 
disclosed herein show that overexpression of DHR96 results in the downregulation of members 
of all four classes of the detoxification machinery, supporting the proposal that DHR96 functions 
5 as a xenobiotic regulator in Drosophila. These findings demonstrate how detoxification 

enzymes are activated in insects upon challenge with an insecticide. Given that this receptor has 
been highly conserved in the distant insect species, Anopheles gambiae, it is likely that it exerts a 
similar function in all insects. Also disclosed are methods for the identification of specific 
compounds or peptides that affect DHR96 activity and can act as effective synergists that, for 
1 0 example, enhance the lethality of pesticides for insect control. 

25. Disclosed are mutants of the DHR96 gene which have reduced DHR96 activity in the 
xenobiotic pathway. These mutants can be used in a variety of methods for isolating new 
molecules that inhibit the xenobiotic pathway, by for example, being used as controls in methods 
that are testing the xenobiotic activity of a particular compound. The mutants can also be used 

1 5 as stock for production of other mutant flies. The mutants can also be used as seed genetic 
backgrounds to change a given population of flies to insecticide sensitive flies, by introducing 
the mutant backgrounds into the populations, through fly breeding. 

26. Also disclosed are compositions which are capable of inhibiting DHR96 protein 
function or gene function, and which in turn inhibit the xenobiotic effect of the DHR96 protein. 

20 For example, disclosed are iRNA molecules which inhibit the function of DHR96 and inhibit the 
xenobiotic effect of DHR96. 

27. Also disclosed are methods of inhibiting insect growth by administering an inhibitor 
of DHR96 to an insect, such as a fly. 

28. Also disclosed are methods of identifying molecules that inhibit DHR96, and inhibit 
25 the xenobiotic activity in an insect, such as a fly, comprising for example, testing compounds for 

inhibition activity of DHR96 and/or inhibition of xenobiotic activity and, then for example, 
comparing the activity of these molecules to the disclosed inhibitors of DHR96, such as the 
mutants or the disclosed iRNA molecules. 

1. The xenobiotic response 
30 29. Virtually every organism faces a fundamental challenge when exposed to potentially 

harmful environmental substances called xenobiotics, which may include pharmaceuticals, plant 
toxins, pollutants, pesticides, hormones and fatty acids. Exposure to xenobiotics can occur either 
directly by physical contact, inhalation, or ingestion of nutrients or indirectly when an organism 
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generates toxic metabolites from less harmful precursors. The mechanisms by which toxic 
compounds are removed and/or neutralized fall into two broad categories. Usually as a result of 
extreme selective pressures, organisms may develop adaptive processes that are highly specific 
to a particular substance, as can be observed in many insect species that become resistant to 
5 pesticides (Wilson, T. G. (2001). Annu Rev Entomol 46, 545-571) or that have evolved the 
ability to utilize hazardous plant species as a food source (Danielson, P. B. et al. (1997). Proc 
Natl Acad Sci U S A 94, 10797-10802; Fogleman, J. C. (2000). Chem Biol Interact 125, 93- 
105.). In contrast to this highly specific response, all metazoan species appear to have a general 
machinery that allows the efficient detoxification of a vast range of chemicals. The general 

10 detoxification mechanisms display a surprising flexibility, which is mainly achieved by two 
factors. First, at least three enzyme classes comprising more than 160 proteins in the mosquito 
and the fruit fly are responsible for metabolizing lipophilic toxins into less harmful substances 
(Ranson, H., et al. (2002). Science 298, 179-181). Second, some enzymes appear to have an 
immense range of substrate specificity. For instance, Cyp3 A4, a member of the cytochrome 

15 p450 monooxygenase family, is capable of neutralizing an estimated 50% of all existing 

prescription drugs (Maurel, P. (1996). (Boca Raton, CRC Press), pp. 241-270). Cytochrome 
p450 enzymes are often referred to as phase I enzymes, because they catalyze the first step in the 
detoxification process by adding oxygen groups to lipophilic chemicals, thus resulting in more 
water-soluble compounds, which in turn facilitates efficient excretion. Other enzyme families 

20 like glutathione transferases, carboxylesterases and UDP-glucuronosyl transferases are classified 
as phase II enzymes, as their role is to catalyze subsequent detoxification steps. 

30. In insects, pesticide resistance is most often the result of mutations that affect the 
general detoxification pathway. For example, the overexpression of a single gene, Cyp6gl, a 
member of the cytochrome p450 family, is sufficient to confer DDT resistance in Drosophila 

25 melanogaster (Daborn, P. B. et al. (2002), Science 297, 2253-2256). The same study 

demonstrated that Cyp6gl is hypertranscribed in over 20 DDT-resistant Drosophila strains of 
' worldwide origin, but further analysis suggested that this finding could be traced back to a single 
event, since all alleles harbor the same Accord transposon in their 5' regulatory region. 

31. In the past decade considerable progress in the field has revealed the mechanisms that 
30 allows an organism to sense a wide range of toxic substances and to understand how xenobiotic 

sensing translates into the induction of highly specific sets of detoxifying enzymes. It quickly 
became apparent that certain members of the so-called nuclear receptor superfamily are the 
central players in this process. Nuclear receptors are ligand-activated transcription factors that 
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play important roles in diverse physiological processes such as cell growth and differentiation, 
embryonic development, and cholesterol metabolism (Francis, G. A. etal. (2003) Annu Rev 
Physiol 65, 261-31 1; Mangelsdorf, D. J.,et al. (1995). Cell 83, 835-839; Tontonoz, P., and 
Mangelsdorf, D. J. (2003). Mol Endocrinol 17, 985-993) Of the 48 nuclear receptors encoded 
5 by the human genome -26 have identified ligands (Kliewer, S. A. (2003) J Nutr 1 33, 2444S- 
2447S), but only three have been associated with xenobiotic activity, namely PXR, CAR and 
VDR (Maglich, J. M.,etal. (2002) Mol Pharmacol 62, 638-646; Makishima, M., et al. 
(2002). Science 296, 1313-1316). These three closely related receptors are not only able to sense 
and bind lipophilic xenobiotic substances directly, but once activated by such a ligand, they can 
10 regulate the expression of enzymes that will neutralize the very compound that had activated 
these nuclear receptors in the first place, thus creating feedback loop. Disclosed is an analogous 
mechanism that exists in the fruit fly, Drosophila melanogaster. The disclosed mechanism 
involves an insect nuclear receptor, the Drosophila DHR96 nuclear receptor. 

(1) Nuclear receptors 

1 5 32. Members of the nuclear receptor superfamily have been one of the most productive 

targets for drug development by the pharmaceutical industry. Efforts along these lines have 
resulted in drugs that have had a major impact on human health, including cancer treatments, 
fertility control, and cholesterol reduction. Nuclear receptors are ligand-activated transcription 
factors, but can have many regulatory functions aside from this ligand activated function. 

20 Nuclear receptors have been organized in a phylogeny-based nomenclature (Nuclear Receptors 
Nomenclature Committee, (1999) Cell 97, 1-3.) of the form NRxyz, where x is the sub-family, y 
is the group and z the gene. For a review see, Robinson-Rechavi, M., et al., Journal of Cell 
Science, Cell Science at a Glance, 1 16(4):585-586 and poster insert, (2003), which is herein 
incorporated by reference at least for material related to nuclear receptors). 

25 33. Nuclear receptors lend themselves to drug intervention because their activity can be 

modulated by small lipophilic compounds that can be easily delivered to animals in a stable 
format. Compounds can be developed that either constitutively activate their cognate receptor, 
called agonists, or constitutively inactivate the receptor, called antagonists. The use of these 
compounds in animals provides a means of tightly regulating nuclear receptor activity in vivo, 

30 with resultant effects on growth and development. 

34. Surprisingly, no similar effort has been made by the agricultural industry to target 
insect nuclear receptors as a means of pest control. This is largely because the mechanism of 
action of most insect nuclear receptors has remained undefined. Disclosed herein it was shown 



— 8 — 



Auomcy Dockci Number 21 101.0053UI 

that an insect nuclear receptor, encoded by DHR96, is required for resistance to toxic compounds 
in Drosophila. Also disclosed are molecules that inhibit the DHR96 function and that inhibiting 
the function of DHR96 makes DHR96 have decreased resistance to pesticides and toxins. Also 
disclosed are methods utilizing DHR96 to identify compounds that modulate its function, such 
5 as inhibit its function. Molecules that inhibit DHR96 render the insect more susceptible and 
sensitive to pesticides. 

35. The Drosophila genome encodes 18 nuclear receptors that have a classical DNA- 
binding and ligand-binding domain and, of those, just two have identified Iigands. In the 
nematode C elegans, it was shown that a mutation in the nuclear receptor nhr-8 gene causes a 

10 reduced resistance to colchicine and chloroquine, suggesting that this gene is involved in the 
xenobiotic pathway (Lindblom, T. H., et al. (2001). Curr Biol 1 1 , 864-868, which is herein 
incorporated by reference at least for material related to nuclear receptors and their activity, and 
for material related to NHR8). Disclosed herein DHR96 mutants are viable under normal 
conditions, but exhibit a significantly lower resistance to DDT when compared to wild type flies. 

15 Additionally, microarray analysis of animals that overexpress DHR96 indicate that this nuclear 
receptor regulates genes which primarily encode detoxification enzymes. 

36. Disclosed herein insecticide function in insects can be reviewed from a different 
perspective. Disclosed are methods for identifying DHR96 antagonists and agonists. Also 
disclosed are methods related to the identification of the DHR96 target gene network. Also 

20 disclosed is a class of pesticides that targets the regulatory pathways that control the 
detoxification machinery. 

(a) Classes of nuclear receptors 

37. Retinoid, vitamin D, steroid, and thyroid hormones are small hydrophobic Iigands 
that initiate a diverse array of developmental and metabolic responses. The receptors that 

25 mediate these responses form the basis of the nuclear hormone receptor superfamily (see Tsai, 
M.-J. & O'Malley, B. W. (1994). Annu. Rev. Biochem. 63, 451-486, for a review). This family is 
defined by a characteristic protein domain structure including a conserved DNA-binding domain 
and a ligand binding/dimerization domain. Members of this superfamily can be divided into 
three classes based on their ligand-binding and DNA-binding properties. Steroid receptors, 

30 including the estrogen and glucocorticoid receptors, form homodimers that bind to an inverted 
repeat of 6 bp consensus half-sites (Tsai, M.-J. & O'Malley, B. W. (1994). Annu. Rev. Biochem. 
63, 451-486, Gronemeyer, H. (1992). FASEBJ. 6, 2524-2529). The second class includes the 
retinoid receptors, RAR and RXR, as well as receptors for thyroid hormone and vitamin D. 
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These receptors can bind to direct repeats of AGGTCA half-sites as homodimers or heterodimers 
(Stunnenberg, H. G. (1993). BioEssays 1 5, 309-315). The third and largest class are referred to 
as orphan receptors since their potential ligands are unknown. At least some of these receptors, 
including Rev-Erb and NGFI-B, can bind to a single AGGTCA half-site (Harding, H. P..& 
5 Lazar, M. A. (1993). Mol Cell Biol 13, 31 13-3121; Wilson, T. E., et al., (1993). Mol Cell Bio. 
13, 5794-5804). Although extensive studies have provided significant insights into the 
mechanisms by which nuclear hormone receptors regulate the transcription of target genes, we 
still know little about how these changes in gene expression result in specific and diverse 
developmental responses. 

10 (b) Drosophila nuclear receptors 

38. There are 18 canonical nuclear receptor genes in the complete genome of the fly 
Drosophila melanogaster (Adams et al., (2000) Science 287, 2185-2195, which is herein 
incorporated by reference at least for material related to nuclear receptors). The 18 members of 
the nuclear hormone receptor superfamily identified in Drosophila are: EcR, usp, til (Pignoni, 

15 F., et al., (1990). Cell 62, 1 51-163), svp (Mlodzik, M., et al., (1990). Cell 60, 21 1-224) , dHNF-4 
( Zhong, W., et al., (1993). EMBOJ 12, 537-544), E75 (Segraves, W. A. & Hogness, D. S. 
(1990). Genes Dev. 4, 204-219), E78 (Stone, B. L. & Thummel, C. S. (1993). Cell 75, 307-320), 
FTZ-F1 (Lavorgna, G., et al., (1991). Science 252, 848-851), DHR3 (Koelle, M. R., et al., 
(1992). Proc. Natl Acad. Sci. USA 89, 6167-6171), DHR4 (Weller J, Sun GC, Zhou B, Lan Q, 

20 Hiruma K, Riddiford LM. Isolation and developmental expression of two nuclear receptors, 
MHR4 and betaFTZ-Fl, in the tobacco hornworm, Manduca sexta. Insect Biochem Mol Biol. 
2001 Jun 22;31(8):827-37.; King-Jones, K. Charles, J.-P., & C.S. Thummel, The DHR4 orphan 
nuclear receptor is required for Drosophila growth and metamorphosis, manuscript in prep; 
Adams et al., (2000) Science 287, 21 85-2195) and DHR39 (Ohno, C. K. & Petkovich, M. 

25 (1992). Mech. Dev. 40, 13-24; Ayer, S., et al., (1993). Nuc. Acids Res. 21, 1619-1627), DHR38, 
DHR78 (Fisk and Thummel, (1995), PNAS, Proc Natl Acad Sci USA. 1995 Nov 
7;92(23): 10604-8), DHR83 (King- Jones, K. and C.S. Thummel (2003) Drosophila nuclear 
receptors. In"Handbook of Cell Signaling," Vol. 3, (Bradshaw, R. and Dennis, E., 
eds.),Academic Press, New York, pp. 69-73; Adams et al., (2000) Science 287, 2185-2195), 

30 DHR96 (Fisk and Thummel, 1993), rfs/(Finley, K. D., et al. (1998). "dissatisfaction encodes a 
Tailless-like nuclear receptor expressed in a subset of CNS neurons controlling Drosophila 
sexual behavior." Neuron 21, 1363-1374), dERR (King- Jones, K. and C.S. Thummel (2003) 
Drosophila nuclear receptors. In"Handbook of Cell Signaling," Vol. 3, (Bradshaw, R. and 
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Dennis, E., eds.),Academic Press, New York, pp. 69-73; Adams et al., (2000) Science 287, 
2185-2195), and dFAX-1 (King-Jones, K. and C.S. Thummel (2003) Drosophila nuclear 
receptors. In"Handbook of Cell Signaling/' Vol. 3, (Bradshaw, R. and Dennis, E., 
eds.),Academic Press, New York, pp. 69-73; Adams et al., (2000) Science 287, 2185-2195) At 
least seven of these genes appear to contribute to the 20E regulatory hierarchies that direct the 
onset of metamorphosis - E75, E78,fiFTZ-Fl, DHR3, DHR39, EcR, and usp (Richards, G. 
(1992). Current Biology 2, 657-659; Horner, M., et al., (1995). Dev. Biol. 168, 490-502; 
Woodard, C. T., et al., (1994). Cell 79, 607-615). 

39. Table 5 provides a list of Drosophila nuclear receptors. 

40. Table 5 



probe set CG 



CT 



Accession 



144004_at CG16902 CT37504 FBgn0023546 

154699_at CG4059 CT13432 FBgnOO01078 

!43123_at CGI 1823 CT11367 FBgn0000448 

152580_at CG11783 CT33046 FBgn0015240 

143535_at CG9310 CT40906 FBgn0004914 

143768_at CGI 864 CT5732 FBgn0014859 

149398_at CG10296 CT2891 1 FBgn0037436 

143372_at CGI 1502 CT12919 FBgn0003651 

143379_at CG1378 CT3134 FBgn0003720 

143805_at CG9019 CT25922 FBgn0O15381 

147244_at CG 16801 CT37351 FBgn0034012 

153072_at CG7404 CT22787 FBgn0035849 

I52160_at CG7199 CT22217 FBgnOOI5239 

I53675_at CG4380 CT14272 FBgn0O03964 



Description 

sym=Hr4 

orEG:133E12.2 

/name= DHR4 

sym=ftz-fl /name=ftz 

transcription factor 1 

sym=Hr46 or DHR3 

/name=Hormone receptor-like 

in 46 

sym=Hr96 or 
DHR96/name=Hormone 
receptor-like in 96 
sym=Hnf4 
/name=Hepatocyte 
nuclear factor 4 
sym=Hr38 or DHR38 
/name=Hormone receptor-like 
in 38 

sym=CG 10296 or 
DHR83 /name-Hr83 
sym=svp /name=seven up 
/prod=nucIear receptor 
NR2F3 

sym=tll /name=tail!ess 
/prod=nuc!ear receptor 
NR2E2 
sym=dsf 

/name=dissatisfaction /prod=» 
/func=receptor 

sym=CG 16801 /name-FAX-l 
/prod=n uc I ear hormone 
receptor-like 

sym=CG7404 /name=ERR 
/prod 55 /func=steroid hormone 
receptor 
sym=Hr78 or 
DHR78/name=Hormone- 
receptor-like in 78 
sym=usp /name=ultraspiracle 
/prod=nuclcar receptor 
NR2B4 



SEQIDNO 

SEQ ID NO 1 
SEQlDNO:3 

SEQ ID NO: 5 

SEQ ID NO: 7 

SEQ ID NO: 9 

SEQIDNO: 
11 

SEQIDNO: 
13 

SEQIDNO: 
15 

SEQIDNO: 
17 

SEQ ID NO: 
19 

SEQIDNO: 
21 

SEQIDNO: 
23 

SEQIDNO: 
25 

SEQIDNO: 27 
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154377_at CGI 765 CT5200 FBgnO0OO546 



I43525_at CGI8023 CT40336 FBgn0004865 



I53197_at CG8127 CT24290 FBgn000056S 



155094_at CG8676 CT5296 FBgn0010229 
41. 



sym=Eip75B or 
E75/name=Ecdysone- induced 
protein 75 B 
sym=Eip78C or 
E78/name=Ecdysone- induced 
protein 78C 

sym=EcR /name=Ecdysone 
receptor /prod=ecdysone 
receptor 

sym=EcR /name=Ecdysone 
receptor /prod=ecdysone 
receptor 



SEQ ID NO: 35 



SEQIDNO: 33 



SEQ ID NO: 29 



SEQIDNO: 31 



42. While there are 1 8 nuclear receptors in flies, there are 48 in humans (Robinson- 
Rechavi et al., (2001) Trends Genet 17, 554-556), 49 in the mouse with the addition of FXRp, 
(Robinson-Rechavi and Laudet, 2003, Methods Enzymol. 2003;364:95-1 18) and more than 270 
5 genes in the nematode worm Caenorhabditis elegans (Sluder et al, (1999). Genome Research 9, 



43. 20E is involved in the metamorphosis of the fruit fly, Drosophila melanogaster 
through steroid hormone receptors. A high titer 20E pulse at the end of third instar larval 

10 development triggers puparium formation, followed 10 hrs later by an 20E pulse that triggers 
head eversion and the onset of pupal development (Pak, M. D. & Gilbert, L. L (1987). /. Liq. 
Chrom. 10, 2591-261 1; Richards, G. (1981). Mol. Cell. Endocrin. 21, 181-197). The 20E 
receptor is encoded by two members of the nuclear hormone receptor superfamily, EcR (Koelle, 
M. R., et al, (1991). Cell 67, 59-77) and usp ( Henrich, V. C, et al, (1990). Nuc. Acids Res. 18, 

15 4H3-4148; shea, M. J, et al, (1990). Genes Dev. 4, 1 128-1 140; Oro, A. E, et al, (1 990). 
Nature 347, 298-301). Usp is most closely related to the vertebrate RXR family and can 
heterodimerize with vertebrate thyroid and vitamin D receptors, as well as with EcR (Y ao, T, et 
al, (1992). Cell 71, 63-72; Thomas, H. E., et al, (1993). Nature 362, 471-475; Yao, T, et al, 
(1993). Nature 366, 476-479; Koelle, M. R. (1992) Ph.D. thesis, Stanford University). The 

20 ability of RXRs to function as promiscuous heterodimerization partners combined with the 
sequence similarity of many receptor binding sites raises the possibility that other members of 
the superfamily may function in transducing 20E signals, either by interacting directly with EcR 
and/or Usp, or by competing for receptor binding sites (Richards, G. (1992). Current Biology 2, 
657-659). 



103-120. 



(c) Role of 20-hydroxyecdysone(20E) in Drosophila 
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(d) General structure of nuclear receptors 

44. There are a number of domains in a nuclear receptor. From the N terminus to the C 
terminus there is the A/B domain, followed by a DNA binding domain (DBD, C), which 
contains the DNA sequence recognition domain called the P-box, which is followed by a less 

5 conserved region, D, which acts as a flexible hinge between the DBD and the ligand binding 
domain (LBD, £) and the D domain typically contains the nuclear localization signal, but this 
may overlap with the C domain, and finally some nuclear receptors contain a C-terminal F 
domain whose function is unknown. 

45. The A/B domain and N terminal region in general is highly variable and can range in 
10 size from less than about 50 amino acids to more than about 500 amino acids. The A/B domain 

typically contains the transactivation domains which typically include at least one constitutively 
active domain, the AF-1 domain, and than typically one or more autonomous activation domains 
which can be regulated or not, called AD domains. 

46. The DBD is typically the most conserved region. It contains the P-box, a six amino 
15 acid region that confers specificity for binding to particular target sites in the DNA. The P-box 

for DHR96 is ESCKA. An example of DHR96 is shown in SEQ ID NO:7. The DBD is also 
typically the site of homo- and hetero- dimerization. The 3D structure of the DBD shows that it 
contains contains two highly conserved zinc- fingers - C-X2-C-X13-C-X2-C and CX5- C-X9-C- 
X2-C - the four cysteines of each finger chelating one Zru+ ion. 
20 47. The LBD is typically the largest domain and is only moderately conserved, but the 

secondary structure is often conserved and contains 12 a-helixes. Many functions are associated 
with the E domain, including the AF-2 transactivation function, a strong dimerization interface, 
another NLS, and often a repression function. Typically the functions are ligand regulated. 

(e) Dimerization of nuclear receptors. 

25 48. Dimerization of nuclear receptors is very important to their function. The 

dimerization domains typically reside in the DBD and LBD. Many nuclear receptors 
heterodimerize with RXRs (USP in arthropods), such as DHR38 (NR4A4), NGFIB (NR4A1), 
NURR1 (NR4A2), NOR1 (NR4A3), LXR and FXR subfamilies (LXRo, (NR1H3), LXR/J 
(NR1H2, HO), ECR (NR1H1), FXRa(NRlH4, HO), FXRjS (NR1H5, HO), the CAR1 and VDR 

30 subfamilies including, CAR1 (NR1I3), PXR (NR1I2), VDR (NR1L1) (NR1 Jl), the PPAR 
subfamily including, PPAR 7 (NR1C3), PPARa(NRlCl), AND PPAR/3 (NR1C2), the RAR 
subfamily including RAR/? (NR1B2), RARa (NR1B1), and RAR7(NR1B3), and TRa(NRlAl), 
and TR0 (NR1A2), and possibly COUP-TF and FXRp (for a review, see Robinson-Rechavi M, 
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Escriva Garcia H, Laudet V., J Cell Sci. 2003 Feb l5;M6(Pt 4):585-6). DHR96 can also be 
found to dimerize with any other receptor, such as USB, or itself. 

(f) Ligands for nuclear receptors 

49. The superfamily includes receptors for many different types of molecules. For 
5 example, nuclear receptors bind hydrophobic molecules such as steroid hormones, such as 

estrogens, glucocorticoids, progesterone, mineralocorticoids, androgens, vitamin D3, ecdysone, 
oxysterols and bile acids. Certain nuclear receptors also bind retinoic acids , such as all-trans 
and 9-cis isoforms, thyroid hormones, fatty acids, leukotrienes and prostaglandins (Escriva et al., 
2000, Bioessays 22, 717-727 and Robinson-Rechavi M, Escriva Garcia H, Laudet V., J Cell Sci. 
10 2003 Feb 15;1 16(Pt 4):585-6). 

(g) How nuclear receptors function 

50. Nuclear receptors typically act in a stepwise fashion that starts with repression, moves 
to a state of derepression, and ends with transcription activation, (reviewed by Robinson- 
Rechavi M, Escriva Garcia H, Laudet V., J Cell Sci. 2003 Feb 15;1 16(Pt 4):585-6). 

15 51. Repression typically occurs with compressors, such as the histone deacetylase activity 

(HDAC) (for example, the apo-nuclear receptor). Usually ligand binding results in derepression, 
caused by the disassociation of the receptor from the compressors. Also ligand binding typically 
causes the recruitment of coactivators, such as histone acetyltransferase (HAT) activity, which 
causes chromatin decondensation, which is believed to be necessary but not sufficient for 

20 activation of the target gene. After the HAT complex dissociates, typically a second coactivator 
complex is assembled (TRAP/DRIP/ARC), which is able to establish contact with the basal 
transcription machinery, and thus results in transcription activation of the target gene, but many 
other transcription co-activators can be associated with the nuclear receptor and these 
coactivators can provide activation discrimination. This general scheme does not apply for all 

25 nuclear receptors, as for example, some nuclear receptors can activate without ligand and some 
may bind DNA without ligand and some may repress with or without ligand. 

(2) DHR96 gene 

52. DHR96 maps to 96B 12-14 in the polytene chromosomes of Drosophila. The DHR96 
gene was cloned and sequenced and its sequence is set forth in SEQ ID NO: 1 . (Fisk and 

30 Thummel (1995) Proc. Natl. Acad. Sci USA, 92: 10604-10608, herein incorporated by reference 
at least for material related to the DHR96 gene and its sequence including the specific sequence). 

53. DHR96 is highly conserved in Anopheles gambiae, a distant (~ 250 M years) dipteran 
species (see Table 4). Similarly, many other Drosophila nuclear receptors are conserved in even 
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more distant insects and, when examined, their regulatory functions appear to be conserved as 
well (Swevers L, latrou K. The ecdysone regulatory cascade and ovarian development in 
lepidopteran insects: insights from the silkmoth paradigm. Insect Biochem Mol Biol. 2003 
Dec;33(l 2): 1285-97; Riddiford LM, Hiruma K, Zhou X, Nelson CA. Insights into the molecular 
5 basis of the hormonal control of molting and metamorphosis from Manduca sexta and 
Drosophila melanogaster. Insect Biochem Mol Biol. 2003 Dec;33( 12): 1327-38). This is 
consistent with the role of detoxification via DHR96 being conserved through evolution. Thus, 
inactivation of DHR96 function in known insect pests provides a novel mode of intervention. It 
is understood that DHR96 homologs in other insects, insect orders, insect families and other 
10 insect specifies are considered disclosed and that they function in a manner similar to DHR96 in 
Drosophila. There is significant homology within the order Diptera and within the class of 
insects in general for nuclear receptors, and there is shown in Table 4, that there is a high degree 
of homology between DHR96 in other insects, such as the mosquito. 

54. Disclosed are DHR96 variants that have at least 60%, 65%, 70%, 75%, 80%, 81%, 
15 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 91%, 92%, 93%, 94%, 95%, 96%, 

97%, 98%, or 99% identity or homology as discussed herein in to the LBD of DHR96, DBD of 
DHR96, or full length DHR96, or of fragments of DHR96, functional or otherwise. 

55. Among the C. elegans receptors, DHR96 is most similar to DAF-12, which is a gene 
involved in dauer larva formation in C. elegans (68% identity DBD; 29% identity LBD). The 

20 match with NHR-8 in C, elegans is weaker (60%; 25%). This is consistent with DHR96 having 
a role similar to DAF-12. DAF-1 2 reads signals from TGFbeta and insulin and decides when the 
worm should enter diapause to survive difficult conditions. Diapause is similar to pupal stages 
in many ways (indeed many insects diapause during metamorphosis). Disclosed herein, mutants 
of DHR96 did not have any effects on metamorphosis - and they survived. Thus it was expected 

25 that DHR96 would have a function similar to DAF-12. DAF-12 is a gene involved in dauer 
larva formation in C. elegans. DAF-12 reads signals from TGFbeta and insulin and decides 
when the worm should enter diapause to survive difficult conditions. Diapause is similar to 
pupal stages in many ways (indeed many insects diapause during metamorphosis). However, as 
disclosed herein, mutants of DHR96 did not have any effects on metamorphosis — as they 

30 survived. 

56. Disclosed are systems that assay for effects of drugs that alter DHR96 - and thus one 
can assay for effects on target gene transcription and relate that expression to the ability of an 
animal, such as an insect, to resist toxins. 
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57. Table 4 

LBD amino 





DBD amino acids 




acids 501 


-7ZJ 


species 


7-72 identity 


p-box 


identity 




anopholes gambiae 


86% 


same 




65% 




% 






% 


c.elegans daf-12 


69% 


same 




26% 


strongyloides stercoralis-parasitic 










worrn 


o/vo 


different 




Z/ v /Q 


c.elegans nhr-48 


bovb 
% 


same 






VDR-zebrafish 


65% 


different 




27% 


VDR-bastard halibut 


63% 


different 




27% 


mouse vdr 


62% 


different 




23% 


human vdr 


62% 


different 




24% 


c.elegans nhr-8 


60% 


same 




25% 


mouse pxr 


59% 


different 




23% 


human pxr 


59% 


different 




22% 


human car 


56% 


different 




19% 


AamEcRAl-tick 


54% 


different 






ecdysone receptor- locusta 










migratoris-locust 


53% 


different 






ecdysone receptor-calliphor viclna- 










insect 


53% 


different 






EcR- tenebrio molitor-yellow 










mealworm 


53% 


different 






EcR- d. melanogaster 


51% 


different 






EcR- aedes albopictus-mosquito 


51% 


different 






mouse car 


51% 


different 




20% 



58. 



59. Table 4 shows the percent identical amino acids within the DNA binding domain and 
ligand binding domain for DHR96 and the best matches in the public databases (Genbank). 
5 Shown is the mosquito DHR96 gene, and it is the orthologous receptor in mosquito, (anopholes 
gambiae) (85% and 65% identity - very high). Also listed is whether the sequence within the P 
box, is either the same as DHR96 or different. This sequence directs the DNA binding 
specificity of the receptor. DHR96 DNA binding is predicted to be similar to that of all three 
nematode homologs (daf-12, nhr-48 and nhr-8), but none of the vertebrate ones. 
10 60. In certain embodiments homologs of DHR96 in other insect species can have at least 

50% identity in the DBD and 25% identity in the LBD. 

61 . An alignment of the Drosophila nuclear hormone receptor DNA-binding domains 
reveals a central region of 8-9 unique amino acids flanked by highly conserved regions that each 
contain a C2C2 zinc finger (Fig. 5). 
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62. The DNA-binding domain of DHR96 is 64% identical to the human vitamin D 
receptor and 52% identical to EcR (Fig. 6C). The DHR96 ligand binding domain (amino acids 
501 - 723) is most similar to that of thyroid hormone receptor, with 23% identity. 

63. DHR96 encodes a 2.8 kb transcript that is expressed throughout third instar larval and 
5 prepupal development, with distinct increases in abundance at 106 hrs after egg laying (Fig. 7). 

The temporal patterns of DHR96 transcription most closely resemble those of the genes 
encoding the 20E receptor. EcR and usp mRNAs can be detected throughout third instar larval 
and prepupal development (Andres, A. J., et al., (1993). Dev. Biol. 160, 388-404; 36; Hcnrich, 
V. C, et al., (1994). Dev. Biol. 165, 38-52). 

10 64. The hsp27 EcRE is the only oligonucleotide bound by DHR96, albeit it a weak 

interaction (Fig. 9). The EcRE consists of a palindromic arrangement of the imperfect half-sites 
AGtgCA and gGtTCA. DHR78 and DHR96 recognize distinct sequences that can also be bound 
by the EcR/Usp heterodimer (Horner, M., et al., (1995). Dev. Biol. 168, 490-502). These distinct 
binding specificities are consistent with the P-box sequences of the DHR78 and DHR96 

15 proteins. The DHR78 P-box, EGCKG, like that of DHR38, directs binding to an AGGTCA half- 
site sequence (Tsai, M.-J. & O'Malley, B. W. (1994). Annu. Rev. Biochem. 63, 451-486). In 
contrast, DHR96 contains a unique P-box sequence that is only present in its three C. elegans 
homologs (see Table 4 above) - ESCKA The binding of the hsp27 EcRE by DHR96 is very 
weak. An optimal DNA binding site can be identified by further experimentation. 

20 65. It will be of interest to determine whether DHR78 or DHR96 can heterodimerize with 

EcR, Usp, or any of the Drosophila orphan receptors. 

(a) DHR96 functions in the xenobiotic pathway 

66. Several lines of evidence support the conclusion that DHR96 acts in a xenobiotic 
pathway. First, the protein is selectively expressed in tissues involved in nutrient absorption 

25 (gastric cacae), metabolism (fat body), and excretion (Malpighian tubules) - tissues that should 
play a primary role in detoxification and elimination of both endobiotic and xenobiotic 
compounds. Second, DHR96 mutants, like null mutants in the mouse PXR and CAR xenobiotic 
nuclear receptors, are viable and fertile, indicating no critical role in normal development. Third, 
DHR96 mutants are more sensitive to the pesticide DDT. Fourth, the most highly repressed 

30 genes in response to DHR96 overexpression comprise members of all four classes of insect 
detoxifying genes. 

67. The effect of the mutants can be confirmed by the expression of wild type DHR96 
(from a heat-inducible DHR96 transgene, for example) in a homozygous mutant background, and 
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test for DDT sensitivity. This experiment should rescue the sensitivity back to wild type levels. 
In addition, DHR96 function was reduced by RNAi and this results in levels of DDT sensitivity 
that are similar to those oTDHR96 mutants. 

68. The decreased resistance to DDT in DHR96 mutants can be confirmed as related to 
5 the inability to neutralize toxins rather than a general lack of fitness by demonstrating that 

sensitivity of DHR96 mutants occurs for toxic compounds. It can also be confirmed by showing 
that detoxifying genes fail to be induced in DHR96 mutants treated with toxic compounds, by for 
example, microarray analysis, with the mutants in the presence or absence of a toxin. These 
results could be compared to the microarray data disclosed herein. Two toxins that could be 
10 used for this are DDT and phenobarbital because the latter was shown to induce a number of 
cytochrome P450 genes in Drosophila (Danielson, P. B. et al. (1998) Mol Gen Genet 259, 54- 
59). 

69. The expression of DHR96 and its activation level can be assayed to determine if it is 
directly activated by toxic compounds, similar to the ability of xenobiotics to bind to human 

15 PXR xenobiotic nuclear receptor. This can be done using transformed Drosophila that express a 
fusion of the yeast GAL4 DNA binding domain to the ligand binding domain of DHR96. When 
combined with a GAL4-dependent lacZ reporter gene, the expression of J3-galactosidase will 
only occur when the DHR96 ligand binding domain is in an active conformation. This could be 
caused by a direct interaction between DHR96 and the toxin. Larval organs that carry these 

20 constructs can be cultured in the presence of various xenobiotic inducers, testing for induction of 
lacZ reporter gene activity. Furthermore, target gene promoters can be identified which can also 
demonstrate a direct interaction between DHR96 and the expression of a detoxifying enzyme. 

70. In the disclosed microarray study, DHR96 was overexpressed and it was found that 
this resulted in repression of a significant number of members of the major detoxification gene 

25 families. Repression of cuticle proteins was also observed, consistent with a role for cuticle . 
formation in inhibiting pesticide toxicity (Wilson, T. G. (2001). Annu Rev Entomol 46, 545- 
571). The observation that these target genes are repressed suggests that DHR96 might function 
as a repressor in the absence of ligand. This is consistent with the action of other nuclear 
receptors, for example, both Endocrine receptor (EcR) and thyroid receptor (TR) are known to 

30 function in this manner. Very strict filtering criteria were used in the disclosed microarray 
experiments further strengthening the results. 

71 . The microarray studies allow the identification of the direct targets of DHR96. This 
will allow the identification of the genetic hierarchy that is regulated by this nuclear receptor. 
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Once target genes have been identified, it will be possible to construct reporter genes that are 
inducible by endogenous DHR96. Such a system can then be utilized to screen for drugs or 
combinations of drugs that activate or repress these reporter genes, in both a wild type and 
DHR96 mutant background. This can further confirm that DHR96 can directly regulate the 
5 expression of detoxifying genes. This system would also provide a direct readout of DHR96 
activity that would be useful for farther studies of DHR96 function and for the development of 
appropriate inhibitors of DHR96 function. The mutants of DHR96 can be used to identify and 
confirm other factors that can act as xenobiotic receptors in insects, and test whether these act in 
a partially redundant manner with DHR96. 
1 0 (3) Mutants of the DHR96 gene 

72. Various DHR96 mutant alleles were made. A series of studies to characterize the 
DHR96 mutant alleles were performed. These included Southern, Northern and Western 
blotting, tissue stains, sequencing of PCR products, and genetic mapping to validate the 
mutations in the different DHR96 alleles. Validation of these alleles was particularly important 

1 5 because flies homozygous for DHR96 mutations are viable and fertile. At least one of the alleles 
generated, DHR96 I6A , is a protein null, because the translation start site was deleted and no 
protein was detectable in Western blots or tissue stains of homozygous mutant animals. 

73. Gene targeting (Rong, Y. S., and Golic, K. G. (2000). Science 288, 2013-2018) 
was used to generate mutations in DHR96 because no deficiencies or P elements were known in 

20 this region of the genome, (see Example 1). Using these methods any mutations of the DHR96 
gene can be made, such as mutations at or around the start site; mutations at or around the splice 
sites; mutations which prevent or render inactive complete or partial exon sequences; mutations 
which render inactive or remove the complete or partial DBD or LBD or any of the domains of 
DHR96 discussed herein that it contains as a nuclear receptor. 

25 74. The DHR96 gene resides on the third chromosome. When mutations are made in 

certain embodiments the mutations of the DHR96 gene are made such that there is only a single 
copy of the mutant and no copies of the wildtype gene in the insect, such as the fly. This is done, 
for example, by using vectors for the mutation generation, which have sites built in that allow for 
recombination and excision of the site, and fly stocks containing a single copy can be selected. 

30 (see for example, Rong, Y. et al., (2002) Genes Dev 16, 1568-1581). 

75. Disclosed are null mutants of the DHR96 gene. A null mutant is defined herein as a 
mutant that lacks functional DHR96 protein product. 
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76. A null mutant disclosed herein is DHR96 I6A which is mutant having two specific 
deletions, one removing the start codon for translation and the second removing intron/exon 4, 
deleting a critical portion of the LBD. 

77. Another null mutant disclosed herein is the mutant DHR96 E25 which carries a tandem 
5 duplication of the DHR96 gene in place of the single wild type copy. One of these mutant 

DHR96 genes is identical to the DHR96 ,6A allele described above, missing both the start codon 
and intron/exon 4. The other mutant DHR96 gene is lacking only intron/exon 4. Western blot 
analysis indicates that both DHR96 E25 mutants, as well as DHR96 !6A mutants, produce no 
detectable DHR96 protein. Thus, both alleles can be considered as null mutations. 

10 78. One way to functionally test the mutants is in a viability assay based on different 

nutritional backgrounds. Disclosed herein, DHR96 mutants will have a decreased ability to grow 
on instant fly food, such as Carolina 424. If yeast is restored to the instant food, viability is 
restored to within wildtype levels, indicating that DHR96 mutants are sensitive to the absence of 
yeast in their food source. In contrast, mutants such as DHR96 E25 or DHR96 I6A are viable when 

15 grown on standard cornmeal medium. 

79. Disclosed are insects, such as flies, containing the mutant DHR96 gene, as well as 
any of their developmental stages, such as larvae, eggs, or pupae. These flies can be used, for 
example, to be crossed with other strains of flies to make new strains harboring the DHR96 
mutants. These strains could also be used, for example, as a type of insect inhibitor themselves, 

20 by being released in the wild to cross with wildtype insects creating mutant insects. For this 
purpose, mutations that create a dominant negative phenotype are preferred, such as those that 
have non-functional LBD, but retain their ability to heterodimerize, thus, interacting with and 
reducing the effect of native proteins in the insect. 

80. The disclosed mutants cause a decrease in the insect's ability to react to toxins or 
25 pesticides, such as DDT. The disclosed mutants, such as DHR96 I6A or DHR96 E2S insects, such 

as flies, were more sensitive to DDT and died at lower concentrations of DDT compared to 
control animals (Fig. 4A). In addition, when challenged with a fixed concentration of DDT, 
DHR96 homozygotes died more rapidly than wild type flies (Fig. 4B). 

81 . Also disclosed are mutants which have a defect in for example, activation with and 
30 without retention of dimerization ability, defects in ligand binding, and defects in DNA binding 

with and without loss of dimerization ability. 

82. Also disclosed are mutants that, when overexpressed, fail to modulate genes in the 
xenobiotic pathway, such as genes in the four major detoxification families, cytochrome P450s, 
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carboxylesterases, glutathione S-transferases, and UDP-glucuronosyltransferases (Oakeshott JG, 
Home I, Sutherland TD, Russell RJ. The genomics of insecticide resistance. Genome Biol. 
2003;4(1):202). In Table 3, two are P450s (Cyp genes), two are glutathione S-transferases , and 
one each of the carboxylesterases and UDP-glucuronosyltransferases were identified by 
5 microarray analysis. These represent the function of these proteins. Also denoted in Table 3 are 
the names of the genes. These are the gene names according to FlyBase 
(http://flybase.bio.indiana.edu/) They are either a proper name, like black or Lcpl, or the CG 
number, which is a numerical designation given to each fly gene. The CG number is usually used 
when the gene is new or of unknown function. This can be determined using microarrays as 
10 disclosed herein. 

(4) Compounds that modulate DHR96 activity 

83. Disclosed are compounds that modulate DHR96 activity. These compounds can, for 
example, modulate the activity of the protein through binding with the protein of DHR96, or 
through binding the mRNA of DHR96, and inhibiting the mRNA, through, for example, 

15 degradation or prevention of translation. The compositions can be any type of molecule, 

including, for example, proteins, small peptides, antibodies, functional nucleic acids, such as 
aptamers, antisense, ribozymes, dsRNA for RNAi or siRNA, or small molecules, such as those 
found in various combinatorial chemistry libraries or natural product libraries. 

84. For example, disclosed are compounds that function by, for example, binding to the 
20 ligand binding domain of DHR96 and inactivating its function or turning it into a constitutive 

repressor, or mimicking the normal cofactors that mediate nuclear receptor signaling to the 
general transcription machinery. These compounds, such as peptides, would render the receptor 
incapable of directing proper target gene transcription, blocking the detoxification response. The 
disclosed compounds can act in combination with known or any pesticide by increasing the 

25 effectiveness of the pesticide by decreasing the insect's ability to react to the pesticide. The 
compositions could be added to pre-existing pesticide formulations, increasing their 
effectiveness. Moreover, resistant lines of insects that respond poorly to a particular pesticide 
may be made more sensitive by adding compounds that affect DHR96 function. DHR96 is a 
target for pest control, capable of regulating insect populations. The compositions could also 

30 prevent or reduce the translation or expression of the DHR96 mRNA, by for example, through 
RNAi or antisense mechanisms. 
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(a) Functional Nucleic Acids 
85. Functional nucleic acids are nucleic acid molecules that have a specific function, such 
as binding a target molecule or catalyzing a specific reaction. Functional nucleic acid molecules 
can be divided into the following categories, which are not meant to be limiting. For example, 
5 functional nucleic acids include RNAi, antisense molecules, aptamers, ribozymes, triplex 

forming molecules, and external guide sequences. The functional nucleic acid molecules can act 
as affectors, inhibitors, modulators, and stimulators of a specific activity possessed by a target 
molecule, or the functional nucleic acid molecules can possess a de novo activity independent of 
any other molecules. 

10 86. Functional nucleic acid molecules can interact with any macromolecule, such as 

DNA, RNA, polypeptides, or carbohydrate chains. Thus, functional nucleic acids can interact 
with the mRNA of DHR96 or variants or fragments or the genomic DNA of DHR96 or variants 
or fragments or they can interact with the polypeptide DHR96 or variants or fragments. Often 
functional nucleic acids are designed to interact with other nucleic acids based on sequence 

15 homology between the target molecule and the functional nucleic acid molecule. In other 

situations, the specific recognition between the functional nucleic acid molecule and the target 
molecule is not based on sequence homology between the functional nucleic acid molecule and 
the target molecule, but rather is based on the formation of tertiary structure that allows specific 
recognition to take place. 

20 87. Disclosed are molecules that inhibit DHR96 activity that are based on RNA 

interference (RNAi) or small interfering RNA (SiRNA). It is thought that RNAi involves a two- 
step mechanism for RNA interference (RNAi): an initiation step and an effector step. For 
example, in the first step, input double- stranded (ds) RNA is processed into small fragments 
(siRNA), such as 2 1-23-nucleotide 'guide sequences'. RNA amplification appears to be able to 

25 occur in whole animals. Typically then, the guide RNAs can be incorporated into a protein RNA 
complex which is cable of degrading RNA, the nuclease complex, which has been called the 
RNA-induced silencing complex (RISC). This RISC complex acts in the second effector step to 
destroy mRNAs that are recognized by the guide RNAs through base-pairing interactions. RNAi 
involves the introduction by any means of double stranded RNA into the cell which triggers 

30 events that cause the degradation of a target RNA. RNAi is a form of post-transcriptional gene 
silencing. Disclosed are RNA hairpins that can act in RNAi. 

88. RNAi has been shown to work in a number of cells, including mammalian and 
invertebrate cells. In certain embodiements the RNA molecules which will be used as targeting 
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sequences within the RISC complex are shorter. For example, less than or equal to 50 or 40 or 
30 or 29, 28, 27, 26, 25, 24, 23, ,22, 2 1,20, 19, 18, 17, 16 , 15, 14, 1 3 , 12, 1 1, or 10 nucleotides 
in length. These RNA molecules can also have overhangs on the 3' or 5* ends relative to the 
target RNA which is to be cleaved. These overhangs can be at least or less than or equal to 1,2, 
5 3, 4, 5, 6, 7, 8, 9, 10, 1 5, or 20 nucleotides long. 

89. Methods of RNAi and SiRNA are described in detail in Hannon et al. (2002), RNA 
Interference, Nature 418:244-250; Brummelkamp et al. (2002), A System for Stable Expression 
of Short Interfering RNAs in Mammalian Cells, Science 296:550-508; Paul et al. (2002), 
Effective expression of small interfering RNA in human cells, Nature Biotechnology 20: 505- 

10 508, which are each incorporated by reference in their entirety for methods of RNAi and SiRNA 
and for designing and testing various oligos useful therein. 

90. RNA interference (RNAi) and gene targeting were used to disrupt DHR96 function 
because no existing mutants were available. The effects of DHR96 RNAi were analyzed by 
generating transgenic lines that express snapback RNA under the control of a heat-inducible 

15 promoter. Three independent lines showed strong reduction of DHR96 mRNA in northern blots 
when treated with a single heat-shock, but displayed no discernable phenotype. Using a variety 
of heat-shock regimens, e.g. longer single and double treatments or 12 hr repetitions, did not 
affect the outcome of this observation. These findings suggest that DHR96 mRNA is not 
necessary for viability under standard conditions, indicating either that DHR96 protein is very 

20 stable or dispensable for survival, and is consistent with the studies of DHR96 null mutants. 

91 . Antisense molecules are designed to interact with a target nucleic acid molecule 
through either canonical or non-canonical base pairing. The interaction of the antisense 
molecule and the target molecule is designed to promote the destruction of the target molecule 
through, for example, RNAseH mediated RNA-DNA hybrid degradation. Alternatively the 

25 antisense molecule is designed to interrupt a processing function that normally would take place 
on the target molecule, such as transcription or replication. Antisense molecules can be designed 
based on the sequence of the target molecule. Numerous methods for optimization of antisense 
efficiency by finding the most accessible regions of the target molecule exist. Exemplary 
methods would be in vitro selection experiments and DNA modification studies using DMS and 

30 DEPC. It is preferred that antisense molecules bind the target molecule with a dissociation 

constant (kd)less than or equal to 10" 6 , 10~ 8 , 10' 10 , or 10' 12 . A representative sample of methods 
and techniques which aid in the design and use of antisense molecules can be found in the 
following non-limiting list of United States patents: 5,135,917, 5,294,533, 5,627,158, 5,641,754, 
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5,691,317, 5,780,607, 5,786,138, 5,849,903, 5,856,103, 5,919,772, 5,955,590, 5,990,088, 
5,994,320, 5,998,602, 6,005,095, 6,007,995, 6,013,522, 6,017,898, 6,018,042, 6,025,198, 
6,033,910, 6,040,296, 6,046,004, 6,046,319, and 6,057,437. 

92. Aptarners are molecules that interact with a target molecule, preferably in a specific 
5 way. Typically aptarners are small nucleic acids ranging from 15-50 bases in length that fold 
into defined secondary and tertiary structures, such as stem-loops or G-quartets. Aptarners can 
bind small molecules, such as ATP (United States patent 5,631,146) and theophiline (United 
States patent 5,580,737), as well as large molecules, such as reverse transcriptase (United States 
patent 5,786,462) and thrombin (United States patent 5,543,293). Aptarners can bind very 

10 tightly with kdS from the target molecule of less than 10* !2 M. It is preferred that the aptarners 
bind the target molecule with a k<t less than I0 -6 , 10" 8 , 10" 10 , or 10' 12 . Aptarners can bind the 
target molecule with a very high degree of specificity. For example, aptarners have been isolated 
that have greater than a 10000 fold difference in binding affinities between the target molecule 
and another molecule that differ at only a single position on the molecule (United States patent 

15 5,543,293). It is preferred that the aptamer have a kd with the target molecule at least 10, 100, 
1000, 10,000, or 100,000 fold lower than the kd with a background binding molecule. It is 
preferred when doing the comparison for a polypeptide for example, that the background 
molecule be a different polypeptide. For example, when determining the specificity of aptarners 
to DHR96 protein or fragments or variants, the background protein could be serum albumin. 

20 Representative examples of how to make and use aptarners to bind a variety of different target 
molecules can be found in the following non-limiting list of United States patents: 5,476,766, 
5,503,978, 5,631,146, 5,731,424 , 5,780,228, 5,792,613, 5,795,721, 5,846,713, 5,858,660 , 
5,861,254, 5,864,026, 5,869,641, 5,958,691, 6,001,988, 6,01 1,020, 6,013,443, 6,020,130, 
6,028,186, 6,030,776, and 6,051,698. 

25 93. Ribozymes are nucleic acid molecules that are capable of catalyzing a chemical 

reaction, either intramolecularly or intermolecularly. Ribozymes are thus catalytic nucleic acid. 
It is preferred that the ribozymes catalyze intermolecular reactions. There are a number of 
different types of ribozymes that catalyze nuclease or nucleic acid polymerase type reactions 
which are based on ribozymes found in natural systems, such as hammerhead ribozymes, (for 

30 example, but not limited to the following United States patents: 5,334,71 1 , 5,436,330, 

5,616,466, 5,633,133, 5,646,020, 5,652,094, 5,712,384, 5,770,715, 5,856,463, 5,861,288, 
5,891,683, 5,891,684, 5,985,621, 5,989,908, 5,998,193, 5,998,203, WO 9858058 by Ludwig and 
Sproat, WO 9858057 by Ludwig and Sproat, and WO 9718312 by Ludwig and Sproat) hairpin 
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ribozymes (for example, but not limited to the following United States patents: 5,63 1,1 15, 
5,646,031, 5,683,902, 5,712,384, 5,856,188, 5,866,701, 5,869,339, and 6,022,962), and 
tetrahymena ribozymes (for example, but not limited to the following United States patents: 
5,595,873 and 5,652,107). There are also a number of ribozymes that are not found in natural 
5 systems, but which have been engineered to catalyze specific reactions de novo (for example, but 
not limited to the following United States patents: 5,580,967, 5,688,670, 5,807,718, and 
5,910,408). Preferred ribozymes cleave RNA or DNA substrates, and more preferably cleave 
RNA substrates. Ribozymes typically cleave nucleic acid substrates through recognition and 
binding of the target substrate with subsequent cleavage. This recognition is often based mostly 

10 on canonical or non-canonical base pair interactions. This property makes ribozymes 

particularly good candidates for target specific cleavage of nucleic acids because recognition of 
the target substrate is based on the target substrates sequence. Representative examples of how 
to make and use ribozymes to catalyze a variety of different reactions can be found in the 
following non-limiting list of United States patents: 5,646,042, 5,693,535, 5,731,295, 5,81 1,300, 

15 5,837,855, 5,869,253, 5,877,021, 5,877,022, 5,972,699, 5,972,704, 5,989,906, and 6,017,756. 

94. Triplex forming functional nucleic acid molecules are molecules that can interact 
with either double-stranded or single-stranded nucleic acid. When triplex molecules interact 
with a target region, a structure called a triplex is formed, in which there are three strands of 
DNA forming a complex dependant on both Watson-Crick and Hoogsteen base-pairing. Triplex 

20 molecules are preferred because they can bind target regions with high affinity and specificity. It 
is preferred that the triplex forming molecules bind the target molecule with a k^ less than 10" 6 , 
10~*, 10" 10 , or 10" 12 . Representative examples of how to make and use triplex forming molecules 
to bind a variety of different target molecules can be found in the following non-limiting list of 
United States patents: 5,176,996, 5,645,985, 5,650,316, 5,683,874, 5,693,773, 5,834,185, 

25 5,869,246, 5,874,566, and 5,962,426. 

95. External guide sequences (EGSs) are molecules that bind a target nucleic acid 
molecule forming a complex, and this complex is recognized by RNase P, which cleaves the 
target molecule. EGSs can be designed to specifically target a RNA molecule of choice. RNAse 
P aids in processing transfer RNA (tRNA) within a cell. Bacterial RNAse P can be recruited to 

30 cleave virtually any RNA sequence by using an EGS that causes the target RNA:EGS complex to 
mimic the natural tRNA substrate. (WO 92/03566 by Yale, and Forster and Airman, Science 
238:407-409(1990)). 
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96. Similarly, eukaryotic EGS/RNAse P-directed cleavage of RNA can be utilized to 
cleave desired targets within eukarotic cells. (Yuan et al., Proc. Natl. Acad. Sci. USA 
89:8006-8010 (1992); WO 93/22434 by Yale; WO 95/24489 by Yale; Yuan and Altaian, EMBO 
J 14:159-168 (1995). and Carrara etal., Proc. Natl. Acad. Sci. ttJSA^ 92:2627-2631 (1995)). 

5 Representative examples of how to make and use EGS molecules to facilitate cleavage of a 

variety of different target molecules be found in the following non-limiting list of United States 
patents: 5,168,053, 5,624,824, 5,683,873, 5,728,521, 5,869,248, and 5,877,162. 

(b) Antibodies 

97. Disclosed are monoclonal and polyclonal as well as chimeric variants of these, that 
10 bind DHR96 or variants or fragments thereof. Also disclosed are monoclonal and polyclonal 

antibodies that bind DHR96 or variants or fragments thereof that inhibit DHR96 activity in, for 
example, the xenobiotic pathways disclosed herein. Various assays are disclosed herein that can 
be used to identify these antibodies, such as the nutritional viability assay disclosed herein or the 
sensitivity to toxins assay disclosed herein. 

15 98. As used herein, the term "antibody" encompasses, but is not limited to, whole 

immunoglobulin (i.e., an intact antibody) of any class. Native antibodies are usually 
heterotetrameric glycoproteins, composed of two identical light (L) chains and two identical 
heavy (H) chains. Typically, each light chain is linked to a heavy chain by one covalent disulfide 
bond, while the number of disulfide linkages varies between the heavy chains of different 

20 immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain 
disulfide bridges. Each heavy chain has at one end a variable domain (V(H)) followed by a 
number of constant domains. Each light chain has a variable domain at one end (V(L)) and a 
constant domain at its other end; the constant domain of the light chain is aligned with the first 
constant domain of the heavy chain, and the light chain variable domain is aligned with the 

25 variable domain of the heavy chain. Particular amino acid residues are believed to form an 
interface between the light and heavy chain variable domains. The light chains of antibodies 
from any vertebrate species can be assigned to one of two clearly distinct types, called kappa (k) 
and lambda (1), based on the amino acid sequences of their constant domains. Depending on the 
amino acid sequence of the constant domain of their heavy chains, immunoglobulins can be 

30 assigned to different classes. There are five major classes of human immunoglobulins: IgA, IgD, 
IgE, IgG and IgM, and several of these may be further divided into subclasses (isotypes), e.g., 
IgG-1, IgG-2, IgG-3, and IgG-4; IgA-1 and IgA-2. One skilled in the art would recognize the 
comparable classes for mouse. The heavy chain constant domains that correspond to the 
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different classes of immunoglobulins are called alpha, delta, epsilon, gamma, and mu, 
respectively. 

99. The term "variable" is used herein to describe certain portions of the variable 
domains that differ in sequence among antibodies and are used in the binding and specificity of 

5 each particular antibody for its particular antigen. However, the variability is not usually evenly 
distributed through the variable domains of antibodies. It is typically concentrated in three 
segments called complementarity determining regions (CDRs) or hypervariable regions both in 
the light chain and the heavy chain variable domains. The more highly conserved portions of the 
variable domains are called the framework (FR). The variable domains of native heavy and light 

10 chains each comprise four FR regions, largely adopting a b-sheet configuration, connected by 
three CDRs, which form loops connecting, and in some cases forming part of, the b-sheet 
structure. The CDRs in each chain are held together in close proximity by the FR regions and, 
with the CDRs from the other chain, contribute to the formation of the antigen binding site of 
antibodies (see Kabat E. A. et al., "Sequences of Proteins of Immunological Interest," National 

15 Institutes of Health, Bethesda, Md. (1987)). The constant domains are not involved directly in 
binding an antibody to an antigen, but exhibit various effector functions, such as participation of 
the antibody in antibody-dependent cellular toxicity. 

100. As used herein, the term "antibody or fragments thereof* encompasses chimeric 
antibodies and hybrid antibodies, with dual or multiple antigen or epitope specificities, and 

20 fragments, such as F(ab')2, Fab', Fab and the like, including hybrid fragments. Thus, fragments 
of the antibodies that retain the ability to bind their specific antigens are provided. For example, 
fragments of antibodies which maintain binding activity to the DHR96 or variants or fragments 
thereof are included within the meaning of the term "antibody or fragment thereof." Such 
antibodies and fragments can be made by techniques known in the art and can be screened for 

25 specificity and activity according to the methods set forth in the Examples and in general 
methods for producing antibodies and screening antibodies for specificity and activity (See 
Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New 
York, (1988)). 

101 . Also included within the meaning of "antibody or fragments thereof are 
30 conjugates of antibody fragments and antigen binding proteins (single chain antibodies) as 

described, for example, in U.S. Pat. No. 4,704,692, the contents of which are hereby 
incorporated by reference. 
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102. Optionally, the antibodies are generated in other species and "humanized" for 
administration in humans. Humanized forms of non-human (e.g., murine) antibodies are 
chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab')2, or other antigen-binding subsequences of antibodies) which contain minimal sequence 

5 derived from non-human immunoglobulin. Humanized antibodies include human 

immunoglobulins (recipient antibody) in which residues from a complementary determining 
region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 
(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. 
In some instances, Fv framework residues of the human immunoglobulin are replaced by 

1 0 corresponding non-human residues. Humanized antibodies may also comprise residues that are 
found neither in the recipient antibody nor in the imported CDR or framework sequences. In 
general, the humanized antibody will comprise substantially all of at least one, and typically two, 
variable domains, in which all or substantially all of the CDR regions correspond to those of a 
non-human immunoglobulin and all or substantially all of the FR regions are those of a human 

15 immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al., Nature, 321 :522-525 (1986); Riechmann et al., Nature, 332:323- 
327 (1988); and Presta, Curr. Op. Struct. Biol., 2:593-596 (1992)). 

103. Methods for humanizing non-human antibodies are well known in the art. 

20 Generally, a humanized antibody has one or more amino acid residues introduced into it from a 
source that is non-human. These non-human amino acid residues are often referred to as 
"import" residues, which are typically taken from an "import" variable domain. Hurnanization 
can be essentially performed following the method of Winter and co-workers (Jones et al., 
Nature, 321 :522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., 

25 Science, 239: 1 534-1 536 (1988)), by substituting rodent CDRs or CDR sequences for the 

corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are 
chimeric antibodies (U.S. Pat. No. 4,816,567), wherein substantially less than an intact human 
variable domain has been substituted by the corresponding sequence from a non-human species. 
In practice, humanized antibodies are typically human antibodies in which some CDR residues 

30 and possibly some FR residues are substituted by residues from analogous sites in rodent 
antibodies. 

104. The choice of human variable domains, both light and heavy, to be used in 
making the humanized antibodies is very important in order to reduce antigenicity. According to 
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the "best-fit" method, the sequence of the variable domain of a rodent antibody is screened 
against the entire library of known human variable domain sequences. The human sequence 
which is closest to that of the rodent is then accepted as the human framework (FR) for the 
humanized antibody (Sims et al., J. Immunol., 151 :2296 (1993) and Chothia et al., J. Mol. 
5 Biol., 196:901 (1987)). Another method uses a particular framework derived from the consensus 
sequence of all human antibodies of a particular subgroup of light or heavy chains. The same 
framework may be used for several different humanized antibodies (Carter et al., Proc. Natl. 
Acad. Sci. USA, 89:4285 (1992); Presta et al., J. Immunol., 151:2623 (1993)). 

105. It is further important that antibodies be humanized with retention of high affinity 
10 for the antigen and other favorable biological properties. To achieve this goal, according to a 

preferred method, humanized antibodies are prepared by a process of analysis of the parental 
sequences and various conceptual humanized products using three dimensional models of the 
parental and humanized sequences. Three dimensional immunoglobulin models are commonly 
available and are familiar to those skilled in the art. Computer programs are available which 

15 illustrate and display probable three-dimensional conformational structures of selected candidate 
immunoglobulin sequences. Inspection of these displays permits analysis of the likely role of the 
residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of 
residues that influence the ability of the candidate immunoglobulin to bind its antigen. In this 
way, FR residues can be selected and combined from the consensus and import sequence so that 

20 the desired antibody characteristic, such as increased affinity for the target antigen(s), is 
achieved. In general, the CDR residues are directly and most substantially involved in 
influencing antigen binding (see, WO 94/04679, published 3 March 1994). 

106. Transgenic animals (e.g., mice) that are capable, upon immunization, of 
producing a full repertoire of human antibodies in the absence of endogenous immunoglobulin 

25 production can be employed. For example, it has been described that the homozygous deletion 
of the antibody heavy chain joining region (J(H)) gene in chimeric and germ-line mutant mice 
results in complete inhibition of endogenous antibody production. Transfer of the human germ- 
line immunoglobulin gene array in such germ-line mutant mice will result in the production of 
human antibodies upon antigen challenge (see, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. 

30 USA, 90:2551-255 (1993); Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al., 
Year in Immuno., 7:33 (1993)). Human antibodies can also be produced in phage display 
libraries (Hoogenboom et al., J. Mol. Biol., 227:381 (1991); Marks et al., J. Mol. Biol., 
222:581 (1 99 1 )). The techniques of Cote et al. and Boemer et al. are also available for the 
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preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, p. 77 (1985); Boerner et al., J. Immunol., 147(1 ):86-95 (1991)). 

107. Disclosed are hybidoma cells that produces the monoclonal antibody. The term 
"monoclonal antibody" as used herein refers to an antibody obtained from a substantially 
5 homogeneous population of antibodies, i.e., the individual antibodies comprising the population 
are identical except for possible naturally occurring mutations that may be present in minor 
amounts. The monoclonal antibodies herein specifically include "chimeric" antibodies in which 
a portion of the heavy and/or light chain is identical with or homologous to corresponding 
sequences in antibodies derived from a particular species or belonging to a particular antibody 

1 0 class or subclass, while the remainder of the chain(s) is identical with or homologous to 

corresponding sequences in antibodies derived from another species or belonging to another 
antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the 
desired activity (See, U.S. Pat. No. 4,816,567 and Morrison et al., Proc. Natl. Acad. Sci. 
USA, 81:6851-6855 (1984)). 

15 108. Monoclonal antibodies may be prepared using hybridoma methods, such as those 

described by Kohler and Milstein, Nature, 256:495 (1975) or Harlow and Lane. Antibodies, A 
Laboratory Manual. Cold Spring Harbor Publications, New York, (1988). In a hybridoma 
method, a mouse or other appropriate host animal, is typically immunized with an immunizing 
agent to elicit lymphocytes that produce or are capable of producing antibodies that will 

20 specifically bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in 
vitro. Preferably, the immunizing agent comprises DHR96 or variants or fragments thereof. 
Traditionally, the generation of monoclonal antibodies has depended on the availability of 
purified protein or peptides for use as the immunogen. More recently DNA based 
immunizations have shown promise as a way to elicit strong immune responses and generate 

25 monoclonal antibodies. In this approach, DNA-based immunization can be used, wherein DNA 
encoding a portion of DHR96 or variants or fragments thereof expressed as a fusion protein with 
human IgGl is injected into the host animal according to methods known in the art (e.g., 
Kilpatrick KE, et al. Gene gun delivered DNA-based immunizations mediate rapid production 
of murine monoclonal antibodies to the Flt-3 receptor. Hybridoma. 1998 Dec; 1 7(6):569-76; 

30 Kilpatrick KE et al. High-affinity monoclonal antibodies to PED/PEA-1 5 generated using 5 
microg of DNA. Hybridoma. 2000 Aug;19(4):297-302, which are incorporated herein by 
referenced in full for the the methods of antibody production) and as described in the examples. 
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1 09. An alternate approach to immunizations with either purified protein or DNA is to 
use antigen expressed in baculovirus. The advantages to this system include ease of generation, 
high levels of expression, and post-translational modifications that are highly similar to those 
seen in mammalian systems. Use of this system involves expressing domains of antibodies to 
5 DHR96 or variants or fragments thereof as fusion proteins. The antigen is produced by inserting 
a gene fragment in-frame between the signal sequence and the mature protein domain of the 
antibodies to DHR96 or variants or fragments thereof nucleotide sequence. This results in the 
display of the foreign proteins on the surface of the virion. This method allows immunization 
with whole virus, eliminating the need for purification of target antigens. 

10 110. Generally, either peripheral blood lymphocytes ("PBLs") are used in methods of 

producing monoclonal antibodies if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to 
form a hybridoma cell (Goding, "Monoclonal Antibodies: Principles and Practice" Academic 

15 Press, (1 986) pp. 59-103). Immortalized cell lines are usually transformed mammalian cells, 
including myeloma cells of rodent, bovine, equine, and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of the 
unfiised, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine 

20 guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. Preferred immortalized cell lines are 
those that fuse efficiently, support stable high level expression of antibody by the selected 
antibody-producing cells, and are sensitive to a medium such as HAT medium. More preferred 

25 immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the 
Salk Institute Cell Distribution Center, San Diego, Calif, and the American Type Culture 
Collection, Rockville, Md. Human myeloma and mouse-human heteromyeloma cell lines also 
have been described for the production of human monoclonal antibodies (Kozbor, J. Immunol., 
133:3001 (1984); Brodeur et al., "Monoclonal Antibody Production Techniques and 

30 Applications" Marcel Dekker, Inc., New York, (1987) pp. 51-63). The culture medium in which 
the hybridoma cells are cultured can then be assayed for the presence of monoclonal antibodies 
directed against DHR96 or variants or fragments thereof. Preferably, the binding specificity of 
monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipitation 
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or by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked 
immunoabsorbent assay (ELISA). Such techniques and assays are known in the art, and are 
described further in the Examples below or in Harlow and Lane "Antibodies, A Laboratory 
Manual" Cold Spring Harbor Publications, New York, (1988). 
5 111. After the desired hybridoma cells are identified, the clones may be subcloned by 

limiting dilution or FACS sorting procedures and grown by standard methods. Suitable culture 
media for this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI- 
1640 medium. Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal. 

1 12. The monoclonal antibodies secreted by the subclones may be isolated or purified 
10 from the culture medium or ascites fluid by conventional immunoglobulin purification 

procedures such as, for example, protein A-Sepharose, protein G, hydroxylapatite 
chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

113. The monoclonal antibodies may also be made by recombinant DNA methods, 
such as those described in U.S. Pat. No. 4,816,567. DNA encoding the monoclonal antibodies 

1 5 can be readily isolated and sequenced using conventional procedures (e.g., by using 

oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells serve as a preferred source of such 
DNA. Once isolated, the DNA may be placed into expression vectors, which are then 
transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 

20 plasmacytoma cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also may 
be modified, for example, by substituting the coding sequence for human heavy and light chain 
constant domains in place of the homologous murine sequences (U.S. Pat. No. 4,816,567) or 
by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence 

25 for a non-immunoglobulin polypeptide. Optionally, such a non-immunoglobulin polypeptide is 
substituted for the constant domains of an antibody or substituted for the variable domains of one 
antigen-combining site of an antibody to create a chimeric bivalent antibody comprising one 
antigen-combining site having specificity for DHR96 or variants or fragments thereof and 
another antigen-combining site having specificity for a different antigen. 

30 114. In vitro methods are also suitable for preparing monovalent antibodies. Digestion 

of antibodies to produce fragments thereof, particularly, Fab fragments, can be accomplished 
using routine techniques known in the art. For instance, digestion can be performed using 
papain. Examples of papain digestion are described in WO 94/29348 published Dec. 22, 1 994, 
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U.S. Pat. No. 4,342,566, and Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring 
Harbor Publications, New York, (1988). Papain digestion of antibodies typically produces two 
identical antigen binding fragments, called Fab fragments, each with a single antigen binding 
site, and a residual Fc fragment. Pepsin treatment yields a fragment, called the F(ab*)2 fragment, 
5 that has two antigen combining sites and is still capable of cross-linking antigen. 

115. The Fab fragments produced in the antibody digestion also contain the constant 
domains of the light chain and the first constant domain of the heavy chain. Fab 1 fragments 
differ from Fab fragments by the addition of a few residues at the carboxy terminus of the heavy 
chain domain including one or more cysteines from the antibody hinge region. The F(ab')2 

10 fragment is a bivalent fragment comprising two Fab' fragments linked by a disulfide bridge at 
the hinge region. Fab'-SH is the designation herein for Fab* in which the cysteine residue(s) of 
the constant domains bear a free thiol group. Antibody fragments originally were produced as 
pairs of Fab' fragments which have hinge cysteines between them. Other chemical couplings of 
antibody fragments are also known. 

15 1 1 6. An isolated imrnunogenically specific paratope or fragment of the antibody is also 

provided. A specific immunogenic epitope of the antibody can be isolated from the whole 
antibody by chemical or mechanical disruption of the molecule. The purified fragments thus 
obtained are tested to determine their immunogenicity and specificity by the methods taught 
herein. Immunoreactive paratopes of the antibody, optionally, are synthesized directly. An 

20 immunoreacrive fragment is defined as an amino acid sequence of at least about two to five 
consecutive amino acids derived from the antibody amino acid sequence. 

1 17. One method of producing proteins comprising the antibodies is to link two or 
more peptides or polypeptides together by protein chemistry techniques. For example, peptides 
or polypeptides can be chemically synthesized using currently available laboratory equipment 

25 using either Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. 
(Applied Biosystems, Inc., Foster City, CA). One skilled in the art can readily appreciate that a 
peptide or polypeptide corresponding to the antibody, for example, can be synthesized by 
standard chemical reactions. For example, a peptide or polypeptide can be synthesized and not 
cleaved from its synthesis resin whereas the other fragment of an antibody can be synthesized 

30 and subsequently cleaved from the resin, thereby exposing a terminal group which is functionally 
blocked on the other fragment. By peptide condensation reactions, these two fragments can be 
co valently joined via a peptide bond at their carboxyl and amino termini, respectively, to form an 
antibody, or fragment thereof. (Grant GA (1992) Synthetic Peptides: A User Guide. W.H. 



— 33 — 



Aiiomcy Docket Number 2 1 1 0 1. 0053 U I 

Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. (1993) Principles of Peptide 
Synthesis. Springer- Verlag Inc., NY. Alternatively, the peptide or polypeptide is independently 
synthesized in vivo as described above. Once isolated, these independent peptides or 
polypeptides may be linked to form an antibody or fragment thereof via similar peptide 
5 condensation reactions. 

1 1 8. For example, enzymatic ligation of cloned or synthetic peptide segments allow 
relatively short peptide fragments to be joined to produce larger peptide fragments, polypeptides 
or whole protein domains (Abrahmsen L et al., Biochemistry, 30:4151 (1991)). Alternatively, 
native chemical ligation of synthetic peptides can be utilized to synthetically construct large 

10 peptides or polypeptides from shorter peptide fragments. This method consists of a two step 
chemical reaction (Dawson et al. Synthesis of Proteins by Native Chemical Ligation. Science, 
266:776-779 (1994)). The first step is the chemoselective reaction of an unprotected synthetic 
peptide-alpha-thioester with another unprotected peptide segment containing an amino-terminal 
Cys residue to give a thioester-linked intermediate as the initial covalent product. Without a 

15 change in the reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular 
reaction to form a native peptide bond at the ligation site. Application of this native chemical 
ligation method to the total synthesis of a protein molecule is illustrated by the preparation of 
human interleukin 8 (IL-8) (Baggiolini M et al. (1992) FEBS Lett. 307:97-101 ; Clark-Lewis I et 
al., J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., Biochemistry, 30:3128 (1991); 

20 Rajarathnam K et al., Biochemistry 33:6623-30 (1994)). 

1 19. Alternatively, unprotected peptide segments are chemically linked where the bond 
formed between the peptide segments as a result of the chemical ligation is an unnatural (non- 
peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)). This technique has been used to 
synthesize analogs of protein domains as well as large amounts of relatively pure proteins with 

25 full biological activity (deLisle Milton RC et al., Techniques in Protein Chemistry IV. Academic 
Press, New York, pp. 257-267 ( 1 992)). 

1 20. Also disclosed are fragments of antibodies which have bioactivity. The 
polypeptide fragments can be recombinant proteins obtained by cloning nucleic acids encoding 
the polypeptide in an expression system capable of producing the polypeptide fragments thereof, 

30 such as an adenovirus or baculovirus expression system. For example, one can determine the 
active domain of an antibody from a specific hybridoma that can cause a biological effect 
associated with the interaction of the antibody with DHR96 or variants or fragments thereof. For 
example, amino acids found to not contribute to either the activity or the binding specificity or 
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affinity of the antibody can be deleted without a loss in the respective activity. For example, in 
various embodiments, amino or carboxy-terminal amino acids are sequentially removed from 
either the native or the modified non-immunoglobulin molecule or the immunoglobulin molecule 
and the respective activity assayed in one of many available assays. In another example, a 
5 fragment of an antibody comprises a modified antibody wherein at least one amino acid has been 
substituted for the naturally occurring amino acid at a specific position, and a portion of either 
amino terminal or carboxy terminal amino acids, or even an internal region of the antibody, has 
been replaced with a polypeptide fragment or other moiety, such as biotin, which can facilitate in 
the purification of the modified antibody. For example, a modified antibody can be fused to a 

1 0 maltose binding protein, through either peptide chemistry or cloning the respective nucleic acids 
encoding the two polypeptide fragments into an expression vector such that the expression of the 
coding region results in a hybrid polypeptide. The hybrid polypeptide can be affinity purified by 
passing it over an amylose affinity column, and the modified antibody receptor can then be 
separated from the maltose binding region by cleaving the hybrid polypeptide with the specific 

15 protease factor Xa. (See, for example, New England Biolabs Product Catalog, 1996, pg. 164.). 
Similar purification procedures are available for isolating hybrid proteins from eukaryotic cells 
as well. 

121. The fragments, whether attached to other sequences or not, include insertions, 
deletions, substitutions, or other selected modifications of particular regions or specific amino 

20 acids residues, provided the activity of the fragment is not significantly altered or impaired 

compared to the nonmodified antibody or antibody fragment. These modifications can provide 
for some additional property, such as to remove or add amino acids capable of disulfide bonding, 
to increase its bio-longevity, to alter its secretory characteristics, etc. In any case, the fragment 
must possess a bioactive property, such as binding activity, regulation of binding at the binding 

25 domain, etc. Functional or active regions of the antibody may be identified by mutagenesis of a 
specific region of the protein, followed by expression and testing of the expressed polypeptide. 
Such methods are readily apparent to a skilled practitioner in the art and can include site-specific 
mutagenesis of the nucleic acid encoding the antigen. (Zoller MJ et al. Nucl. Acids Res. 
10:6487-500(1982). 

30 1 22. A variety of immunoassay formats may be used to select antibodies that 

selectively bind with a particular protein, variant, or fragment. For example, solid-phase ELISA 
immunoassays are routinely used to select antibodies selectively immunoreactive with a protein, 
protein variant, or fragment thereof. See Harlow and Lane. Antibodies, A Laboratory Manual. 



— 35 — 



Auorncy Docket Number 21 101. 0053 U I 

Cold Spring Harbor Publications, New York, (1988), for a description of immunoassay formats 
and conditions that could be used to determine selective binding. The binding affinity of a 
monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson et 
al., Anal. Biochem., 107:220(1980). 
5 123. Also provided is an antibody reagent kit comprising containers of the monoclonal 

antibody or fragment thereof and one or more reagents for detecting binding of the antibody or 
fragment thereof to DHR96 or variants or fragments thereof. The reagents can include, for 
example, fluorescent tags, enzymatic tags, or other tags. The reagents can also include secondary 
or tertiary antibodies or reagents for enzymatic reactions, wherein the enzymatic reactions 
1 0 produce a product that can be visualized. 

(c) Compositions identified by screening with disclosed 
compositions / com bin atorial chem istry 
(i) Combinatorial chemistry 

1 24. The disclosed compositions can be used as targets for any combinatorial 

1 5 technique to identify molecules or macromolecular molecules that interact with the disclosed 
compositions in a desired way. The nucleic acids, peptides, and related molecules disclosed 
herein, such as DHR96 or variants or fragments thereof, can be used as targets for the 
combinatorial approaches. Also disclosed are the compositions that are identified through 
combinatorial techniques or screening techniques in which the compositions, such as DHR96 or 

20 variants or fragments thereof, or portions thereof, are used as the target in a combinatorial or 
screening protocol. 

125. It is understood that when using the disclosed compositions in combinatorial 
techniques or screening methods, molecules, such as macromolecular molecules, will be 
identified that have particular desired properties such as inhibition or stimulation or the target 

25 molecule's function. The molecules identified and isolated when using the disclosed 

compositions, such as, DHR96 or variants or fragments thereof, are also disclosed. Thus, the 
products produced using the combinatorial or screening approaches that involve the disclosed 
compositions, such as, DHR96 or variants or fragments thereof, are also considered herein 
disclosed. 

30 126. It is understood that the disclosed methods for identifying molecules that inhibit 

the interactions between, for example, DHR96 or variants or fragments thereof, can be 
performed using high through put means. For example, putative inhibitors can be identified 
using Fluorescence Resonance Energy Transfer (FRET) to quickly identify interactions. The 
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underlying theory of the techniques is that when two molecules are close in space, ie, interacting 
at a level beyond background, a signal is produced or a signal can be quenched. Then, a variety 
of experiments can be performed, including, for example, adding in a putative inhibitor. If the 
inhibitor competes with the interaction between the two signaling molecules, the signals will be 
5 removed from each other in space, and this will cause a decrease or an increase in the signal, 
depending on the type of signal used. This decrease or increasing signal can be correlated to the 
presence or absence of the putative inhibitor. Any signaling means can be used. For example, 
disclosed are methods of identifying an inhibitor of the interaction between any two of the 
disclosed molecules comprising, contacting a first molecule and a second molecule together in 

10 the presence of a putative inhibitor, wherein the first molecule or second molecule comprises a 
fluorescence donor, wherein the first or second molecule, typically the molecule not comprising 
the donor, comprises a fluorescence acceptor; and measuring Fluorescence Resonance Energy 
Transfer (FRET), in the presence of the putative inhibitor and the in absence of the putative 
inhibitor, wherein a decrease in FRET in the presence of the putative inhibitor as compared to 

1 5 FRET measurement in its absence indicates the putative inhibitor inhibits binding between the 
two molecules. This type of method can be performed with a cell system as well. 

127. Combinatorial chemistry includes but is not limited to all methods for isolating 
small molecules or macromolecules that are capable of binding either a small molecule or 
another macromolecule, typically in an iterative process. Proteins, oligonucleotides, and sugars 

20 are examples of macromolecules. For example, oligonucleotide molecules with a given 
function, catalytic or ligand-binding, can be isolated from a complex mixture of random 
oligonucleotides in what has been referred to as "in vitro genetics" (Szostak, TIBS 19:89, 1992). 
One synthesizes a large pool of molecules bearing random and defined sequences and subjects 
that complex mixture, for example, approximately 10 ,s individual sequences in 100 ng of a 100 

25 nucleotide RNA, to some selection and enrichment process. Through repeated cycles of affinity 
chromatography and PCR amplification of the molecules bound to the ligand on the column, 
Ellington and Szostak (1990) estimated that 1 in 10 10 RNA molecules folded in such a way as to 
bind a small molecule dyes. DNA molecules with such ligand-binding behavior have been 
isolated as well (Ellington and Szostak, 1992; Bock et al, 1992). Techniques aimed at similar 

30 goals exist for small organic molecules, proteins, antibodies and other macromolecules known to 
those of skill in the art. Screening sets of molecules for a desired activity whether based on 
small organic libraries, oligonucleotides, or antibodies is broadly referred to as combinatorial 
chemistry. Combinatorial techniques are particularly suited for defining binding interactions 



Attorney Docket Number 2 1 101. 0053 U I 

between molecules and for isolating molecules that have a specific binding activity, often called 
aptamers when the macromolecules are nucleic acids. 

128. There are a number of methods for isolating proteins which either have de novo 
activity or a modified activity. For example, phage display libraries have been used to isolate 

5 numerous peptides that interact with a specific target. (See for example, United States Patent 
No. 6,03 1 ,07 1 ; 5,824,520; 5,596,079; and 5,565,332 which are herein incorporated by reference 
at least for their material related to phage display and methods relate to combinatorial chemistry) 

129. A preferred method for isolating proteins that have a given function is described 
by Roberts and Szostak (Roberts R.W. and Szostak J.W. Proc. Natl. Acad. Sci. USA, 

10 94(23)12997-302 (1997). This combinatorial chemistry method couples the functional power of 
proteins and the genetic power of nucleic acids. An RNA molecule is generated in which a 
puromycin molecule is covalently attached to the 3'-end of the RNA molecule. An in vitro 
translation of this modified RNA molecule causes the correct protein, encoded by the RNA to be 
translated. In addition, because of the attachment of the puromycin, a peptdyl acceptor which 

15 cannot be extended, the growing peptide chain is attached to the puromycin which is attached to 
the RNA. Thus, the protein molecule is attached to the genetic material that encodes it. Normal 
in vitro selection procedures can now be done to isolate functional peptides. Once the selection 
procedure for peptide function is complete traditional nucleic acid manipulation procedures are 
performed to amplify the nucleic acid that codes for the selected functional peptides. After 

20 amplification of the genetic material, new RNA is transcribed with puromycin at the 3'-end, new 
peptide is translated and another functional round of selection is performed. Thus, protein 
selection can be performed in an iterative manner just like nucleic acid selection techniques. The 
peptide which is translated is controlled by the sequence of the RNA attached to the puromycin. 
This sequence can be anything from a random sequence engineered for optimum translation (i.e. 

25 no stop codons etc.) or it can be a degenerate sequence of a known RNA molecule to look for 
improved or altered function of a known peptide. The conditions for nucleic acid amplification 
and in vitro translation are well known to those of ordinary skill in the art and are preferably 
performed as in Roberts and Szostak (Roberts R.W. and Szostak J.W. Proc. Natl. Acad. Sci. 
USA, 94(23)12997-302 (1997)). 

30 130. Another preferred method for combinatorial methods designed to isolate peptides 

is described in Cohen et al. (Cohen B.A.,et al., Proc. Natl. Acad. Sci. USA 95(24): 14272-7 
(1998)). This method utilizes and modifies two-hybrid technology. Yeast two-hybrid systems 
are useful for the detection and analysis of proteimprotein interactions. The two-hybrid system, 
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initially described in the yeast Saccharomyces cerevisiae, is a powerful molecular genetic 
technique for identifying new regulatory molecules, specific to the protein of interest (Fields and 
Song, Nature 340:245-6 (1989)). Cohen et al., modified this technology so that novel 
interactions between synthetic or engineered peptide sequences could be identified which bind a 

5 molecule of choice. The benefit of this type of technology is that the selection is done in an 
intracellular environment. The method utilizes a library of peptide molecules that attached to an 
acidic activation domain. A peptide of choice, for example, of DHR96 or variants or fragments 
thereof, is attached to a DNA binding domain of a transcriptional activation protein, such as Gal 
4. By performing the two-hybrid technique on this type of system, molecules that bind DHR96 

10 or variants or fragments thereof can be identified. 

131. Using methodology well known to those of skill in the art, in combination with 
various combinatorial libraries, one can isolate and characterize those small molecules or 
macromolecules, which bind to or interact with the desired target. The relative binding affinity 
of these compounds can be compared and optimum compounds identified using competitive 

15 binding studies, which are well known to those of skill in the art. 

132. Techniques for making combinatorial libraries and screening combinatorial 
libraries to isolate molecules which bind a desired target are well known to those of skill in the 
art. Representative techniques and methods can be found in but are not limited to United States 
patents 5,084,824, 5,288,514, 5,449,754, 5,506,337, 5,539,083, 5,545,568, 5,556,762, 5,565,324, 

20 5,565,332, 5,573,905, 5,618,825, 5,619,680, 5,627,210, 5,646,285, 5,663,046, 5,670,326, 
5,677,195, 5,683,899, 5,688,696, 5,688,997, 5,698,685, 5,712,146, 5,721,099, 5,723,598, 
5,741,713, 5,792,431, 5,807,683, 5,807,754, 5,821,130, 5,831,014, 5,834,195, 5,834,318, 
5,834,588, 5,840,500, 5,847,150, 5,856,107, 5,856,496, 5,859,190, 5,864,010, 5,874,443, 
5,877,214, 5,880,972, 5,886,126, 5,886,127, 5,891,737, 5,916,899, 5,919,955, 5,925,527, 

25 5,939,268, 5,942,387, 5,945,070, 5,948,696, 5,958,702, 5,958,792, 5,962,337, 5,965,719, 
5,972,719, 5,976,894, 5,980,704, 5,985,356, 5,999,086, 6,001,579, 6,004,617, 6,008,321, 
6,017,768, 6,025,371, 6,030,917, 6,040,193, 6,045,671, 6,045,755, 6,060,596, and 6,061,636. 

1 33. Combinatorial libraries can be made from a wide array of molecules using a 
number of different synthetic techniques. For example, libraries containing fused 2,4- 

30 pyrimidinediones (United States patent 6,025,37 1 ) dihydrobenzopyrans (United States Patent 
6,017,768and 5,821,130), amide alcohols (United States Patent 5,976,894), hydroxy-amino acid 
amides (United States Patent 5,972,719) carbohydrates (United States patent 5,965,719), 1,4- 
benzodiazepin-2,5-diones (United States patent 5,962,337), cyclics (United States patent 
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5,958,792), biaryl amino acid amides (United States patent 5,948,696), thiophenes (United States 
patent 5,942,387), tricyclic Tetrahydroquinolines (United States patent 5,925,527), benzofurans 
(United States patent 5,919,955), isoquinolines (United States patent 5,916,899), hydantoin and 
thiohydantoin (United States patent 5,859,190), indoles (United States patent 5,856,496), 
5 imidazol-pyrido-indole and imidazol-pyrido-benzothiophenes (United States patent 5,856,107) 
substituted 2-methylene-2, 3-dihydrothiazoles (United States patent 5,847,150), quinolines 
(United States patent 5,840,500), PNA (United States patent 5,831,014), containing tags (United 
States patent 5,721 ,099), polyketides (United States patent 5,712,146), morpholino-subunits 
(United States patent 5,698,685 and 5,506,337), sulfamides (United States patent 5,618,825), and 
10 benzodiazepines (United States patent 5,288,514). 

134. As used herein combinatorial methods and libraries included traditional screening 
methods and libraries as well as methods and libraries used in interative processes. 

(ii) Computer assisted drug design 

135. The disclosed compositions can be used as targets for any molecular modeling 

1 5 technique to identify either the structure of the disclosed compositions or to identify potential or 
actual molecules, such as small molecules, which interact in a desired way with the disclosed 
compositions. The nucleic acids, peptides, and related molecules disclosed herein, such as 
DHR96 or variants or fragments thereof, can be used as targets in any molecular modeling 
program or approach. 

20 1 36. It is understood that when using the disclosed compositions in modeling 

techniques, molecules, such as macromolecular molecules, will be identified that have particular 
desired properties such as inhibition or stimulation or the target molecule's function. The 
molecules identified and isolated when using the disclosed compositions, such as, DHR96 or 
variants or fragments thereof, are also disclosed. Thus, the products produced using the 

25 molecular modeling approaches that involve the disclosed compositions, such as, DHR96 or 
variants or fragments thereof, are also considered herein disclosed. 

137. Thus, one way to isolate molecules that bind a molecule of choice is through 
rational design. This is achieved through structural information and computer modeling. 
Computer modeling technology allows visualization of the three-dimensional atomic structure of 

30 a selected molecule and the rational design of new compounds that will interact with the 

molecule. The three-dimensional construct typically depends on data from x-ray crystallographic 
analyses or NMR imaging of the selected molecule. The molecular dynamics require force field 
data. The computer graphics systems enable prediction of how a new compound will link to the 
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target molecule and allow experimental manipulation of the structures of the compound and 
target molecule to perfect binding specificity. Prediction of what the molecule-compound 
interaction will be when small changes are made in one or both requires molecular mechanics 
software and computationally intensive computers, usually coupled with user-friendly, menu- 
5 driven interfaces between the molecular design program and the user. 

1 38. Examples of molecular modeling systems are the CHARMm and QUANTA 
programs, Polygen Corporation, Waltham, MA. CHARMm performs the energy minimization 
and molecular dynamics functions. QUANTA performs the construction, graphic modeling and 
analysis of molecular structure. QUANTA allows interactive construction, modification, 

10 visualization, and analysis of the behavior of molecules with each other. 

139. A number of articles review computer modeling of drugs interactive with specific 
proteins, such as Rotivinen, et al., 1988 Acta Pharmaceutica Fennica 97, 159-166; Ripka, New 
Scientist 54-57 (June 16, 1988); McKinaly and Rossmann, 1989 Annu. Rev. Pharmacol 
Toxiciol. 29, 1 1 1-122; Perry and Da vies, OSAR: Quantitative Structure-Activity Relationships 

15 in Drug Design pp. 189-193 (Alan R. Liss, Inc. 1989); Lewis and Dean, 1989 Proc. R. Soc. 
Lond. 236, 125-140 and 141-162; and, with respect to a model enzyme for nucleic acid 
components, Askew, etal., 1989 7. Am. Chem. Soc. 111,1082-1090. Other computer 
programs that screen and graphically depict chemicals are available from companies such as 
BioDesign, Inc., Pasadena, CA., Allelix, Inc, Mississauga, Ontario, Canada, and Hypercube, Inc., 

20 Cambridge, Ontario. Although these are primarily designed for application to drugs specific to 
particular proteins, they can be adapted to design of molecules specifically interacting with 
specific regions of DNA or RNA, once that region is identified. 

140. Although described above with reference to design and generation of compounds 
which could alter binding, one could also screen libraries of known compounds, including 

25 natural products or synthetic chemicals, and biologically active materials, including proteins, for 
compounds which alter substrate binding or enzymatic activity. 

(5) Insects that can be targeted 

141. Arthropods include Crustacea, which are things like prawns, crabs and woodlice; 
Myriapoda, which are centipedes, millipedes and such; Chelicerata (Arachnida), which are 

30 spiders, scorpions and harvestmen etc., and Uniramia (Insecta), which are things like beetles, 
bees and flies. 

142. Insects are found in the phylum Arthorpoda, Subphylum Insecta (also often called 
a class), Class Hexapoda, and Subclasses Apterygota, Exopterygota, and Endopterygota. The 
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Apterygota includes the orders Protiira, Collembola (Springtails), Thysanura (Silverfish), Diplura 
(Two Pronged Bristle-tails). The Exopterygota includes the orders Ephemeroptera (Mayflies), 
Odonata (Dragon flies), Plecoptera (Stoneflies), Grylloblatodea, Orthoptera, Phasmida (Stick- 
Insects), Dermaptera (Earwigs), Embioptera (Web Spinners), Dictyoptera (Cockroaches and 
5 Mantids), Isoptera (Termites), Zoraptera, Psocoptera (Bark and Book Lice), Mallophaga (Biting 
Lice), Siphunculata (Sucking Lice ), Hemiptera (True Bugs) Thysanoptera, The Endopterygota 
includes the orders Neuropter (Lacewings), Coleoptera (Beetles), Strepsiptera (Stylops), 
Mecoptera (Scorpionflies), Siphonaptera (Fleas), Diptera (True Flies which are unusual in that 
they only have one pair of functional wings. The other pair is reduced to a pair of knoblike 
10 organs, called halteres, which play a part in stabilizing these insects during flight. True flies 
include house flies and bluebottles, mosquitoes, horseflies, midges, and antler-headed flies), 
Lepidoptera (Butterflies and Moths), Trichoptera (Caddis Flies), and Hymenoptera (Ants Bees 
and Wasps). 

(6) Exemplary pesticides that can be used in combination 

15 143. The disclosed compositions, such as DHR96 inhibitors can be combined with any 

pesticide or class of pesticides. For example, the DHR96 inhibitors can be combined with a 
pesticide that invokes the xenobiotic pathway. The DHR96 inhibitors can also be combined with 
any pesticide that effects the expression of a gene in the following four familes, cytochrome 
P450s, carboxylesterases, glutathione S-transferases, and UDP-glucuronosyltransferases When it 

20 is unknown which xenobiotic genes are affected by the pesticide, this can be determined by 
observing whether the pesticide turns on one or more genes that are in the xenobiotic pathway, 
by for example, microarray technology, or any other technology that determines gene expression, 
such as RT-PCR. In certain embodiements, when a particular gene product is specifically 
overexpressed in a resistant line of insects, that gene product can be considered a xenobiotic 

25 gene. Other examples, such as cuticle proteins and a serum carrier protein, were seen in the 
microarray experiments as well. In other embodiements any encoded protein that confers 
resistance to a toxic compound can be considered a xenobiotic compound. 

144. There are many different pesticides that are relatively common chemicals, such as 
arsenicals, petroleum oils, nicotine, pyrethrum, rotenone, sulfur, hydrogen cyanide gas, and 

30 cryolite. However, most pesticides are non-natural chemically synthesized compounds. For 
example, there are different classes and subclasses of pesticides, such as organochlorines, 
examples of which are diphenyl aliphatics, hexchlorocyclohexane (HCH) or 
benzenehexachloride (BHC), Cyclodienes, Polychloroterpenes, organophosphates (OPs) 
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examples of wheh are esters of phosphorus, organosulfers, carbamates, formamidines, 
dinitrophenols, oganotins, pyrethroids, nicotinoids (also known as nitro-quanidines, 
neonicotinyls, neonicotinoids, chloronicotines, or chloronicotinyls), spinosyns, fiproles (or 
Phenylpyrazoles), pyrroles, pyrazoles, pyridazinones, quinazolines, benzoylureas, botanicals, 
5 (natural insecticides), synergists or activators, antibiotics, ftimigants, insect repellants, and 
inorganics. 

145. Another way of classifying insecticides is by their mode of action, for example, 
sodium and/or potassium channel inhibitors, buerotoxins, GABA (gamma-aminobutyric acid) 
receptor modulators, such as inhibitors and activators, cholinesterase (ChE) inhibitors, 

10 aliesterase inhibitors, monoamine oxidase inhibitors, oxidative phosphorylation couplers or 
uncouplers, adenosine triphosphate (ATP) formation inhibitors, dinitrophenol uncoupling 
inhibitors, axionic poisons, inhibition of postsynaptic nicotinergic acetylcholine receptors, 
inhibiting of binding of acetylcholine in nicotinic acetylcholine receptors at the postsynaptic cell, 
inhibition of gamma-aminobutyric acid- (GABA) regulated chloride channels in neurons, 

1 5 inhibitors of mitochondrial electron transport at the NADH-CoQ reductase site, general 

inhibitors of mitochondrial electron transport at Site 1, insect growth regulators (IGR, inhibitors 
of various life cycles and stages in the insect), chitin synthesis inhibitors, inhibitors of 
exoskeleton development, respiratory enzyme inhibitors, inhibitors of the interaction between 
NAD+ and coenzyme Q, .inhibitors of molting, inhibitors of the biosynthesis or metabolism of 

20 ecdysone, synergists, such as inhibitors of cytochrome P-450 dependent polysubstrate 
monooxygenases (PSMOs), and narcotics, calcium channel inhibitors, and repellants. 

146. Examples of organochlorines are (chlorinated hydrocarbons, chlorinated organics, 
chlorinated insecticides, and chlorinated synthetics) Diphenyl Aliphatics, such as DDT, DDD, 
dicofol, ethylan, chlorobenzilate, and methoxychlor, Hexchlorocyclohexanes (HCH) or 

25 benzenehexachloride (BHC), which are typically gamma isomers, such as lindane, Cyclodienes, 
such as chlordane, aldrin and dieldrin, heptachlor, endrin, mirex, endosulfan, and chlordecone 
(Kepone®), and Polychloroterpenes, such as toxaphene and strobane. 

147. Examples of organophosphates (OPs) examples of which are esters of 
phosphorus, (also called organic phosphates, phosphorus insecticides, nerve gas relatives, and 

30 phosphoric acid esters) derived from phosphorus acids, such as sarin, soman, and tabun, 
subclasses included phosphates, phospho-nates, phosphorothioates, phosphorodithioates, 
phosphorothiolates and phosphoramidates. There are also aliphatic, phenyl, and heterocyclic 
derivatives. The aliphatics include TEPP, malathion, trichlorfon (Dylox®), monocrotophos 
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(Azodrin®), dimethoate (Cygon®), oxydemetonmethyl (Meta Systox®), dimethoate (Cygon®), 
dicrotophos (Bidrin®), disulfoton (Di-Syston®), dichlorvos (Vapona®), rnevinphos 
(Phosdrin®), methamidophos (Monitor®), and acephate (Orthene®). The Phenyl derivatives 
parathion (ethyl parathion), methyl parathion, profenofos (Curacron®), sulprofos (Bolstar®), 
isofenphos (Oftanol®, Pryfon®), fenitrothion (Sumithion®), fenthion (Dasanit®), famphur 
(Cyflee® and Warbex®). The Heterocyclic derivatives include diazinon, azinphos-methyl 
(Guthion®), azinphos-ethyl (Acifon®, Gusathion®), chlorpyrifos (Dursban®, Lorsban®, Lock- 
On®), methidathion (Supracide®), phosmet (Imidan®), isazophos (Brace®, Triumph®), and 
chlorpyrifos-methyl (Reldan®). 

148. Examples of organosulfers typically contain two phenyl rings, resembling DDT, 
with sulfur in place of carbon as the central atom, and include tetradifon (Tedion®), propargite 
(Omite®, Comite®), and ovex (Ovotran®). 

149. Examples of carbamates are derivatives of carbamic acid and include 
carbaryl (Sevin®), methomyl (Lannate®), carbofuran (Furadan®), aldicarb (Temik®), oxamyl 
(Vydate®), thiodicarb (Larvin®), methiocarb (Mesurol®), propoxur (Baygon®), bendiocarb 
(Ficam®), carbosulfan (Advantage®), aldoxycarb (Standak®), promecarb (Carbamult®), and 
fenoxycarb (Logic®, Torus®). 

150. Examples of formamidines include chlordimeform (Galecron®, Fundal®), 
formetanate (Carzol®), and amitraz (Mitac®, Ovasyn®. 

151. Examples of dinitrophenols include binapacryl (Morocide®) and dinocap 
(Karathane®). 

1 52. Examples of oganotins include cyhexatin (Plictran®) and Fenbutatin-oxide 
(Vendex®). 

153. Examples of pyrethroids natural pyrethrum and synthetic pyrethroids including 
allethrin (Pynamin®), tetramethrin (Neo-Pynamin®) (1965), resmethrin (Synthrin®), 
bioresrnethrin, Bioallethrin®, phonothrin (Sumithrin®), fenvalerate (Pydrin®, Tribute®, & 
Bellmark®), permethrin (Ambush®, Astro®, Dragnet®, Flee®, Pounce®, Prelude®, Talcord® 
& Torpedo®), bifenthrin (Capture®, Talstar®), /awWa-cyhalothrin (Demand®, Karate®, 
Scimitar® & Warrior®), cypermethrin (Ammo®, Barricade®,Cymbush®, Cynoff® & 
Ripcord®), cyfluthrin (Baythroid®, Countdown®, Cylense®, Laser® & Tempo®), deltamethrin 
(Decis®) esfenvalerate (Asana®, Hallmark®), fenpropathrin (Danitol®), flucythrinate (Cybolt®, 
Payoff®), fluvalinate (Mavrik®, Spur ®), prallethrin (Etoc®), /au-fluvalinate (Mavrik®) 
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tefluthrin (Evict®, Fireban®, Force® & Raze®), tralomethrin (Scout X-TRA®, Tralex®), and 
ze/tf-cypermethrin (Mustang® Fury®), acrinathrin (Rufast®), and imiprothrin (Pralle®. 

1 54. Examples of nicotinoids (also known as nitro-quanidines, neonicotinyls, 
neonicotinoids, chloronicotines, or chloronicotinyls) including Imidacloprid (Admire®, 

5 Confidor®,Gaucho®, Merit®, Premiei®, Premise® and Provado®), acetamiprid (Mospilan®), 
thiamethoxam (Actara®, Platinum®), and nitenpyram (Bestguard®). 

1 55. Examples of spinosyns include (Success®, Tracer Naturalyte®). 

1 56. Examples of fiproles (or Phenylpyrazoles) include Fipronil ((Regent®, Icon®, 
Frontline®). 

10 157. Examples of pyrroles include Chlorfenapyr ((Alert®, Pirate®. 

1 58. Examples of pyrazoles include tebufenpyrad (Pyranica®, Masai®) and 
fenpyroximate (Acaban®, Dynamite®). 

159. Examples of pyridazinones include Pyridaben ((Nexter®, Sanmite®). 

1 60. Examples of quinazolines fenazaquin ((Matador®). 

15 161 . Examples of benzoylureas include triflumuron (Alsystin®), chlorfluazuron 

(Atabron®, Helix®), followed by teflubenzuron (Nomolt®, Dart®), hexaflumuron (Trueno®, 
Consult®), flufenoxuron (Cascade®), flucycloxuron (Andalin®), flurazuron, novaluron, 
diafenthiuron, Lufenuron (Axor®), and diflubenzuron ((Dimilin®, Adept®, Micromite®). 

162. Examples of botanicals, (natural insecticides) include sulfur, tobacco, pyrethrum, 
20 derris, hellebore, quassia, camphor, and turpentine, and Pyrethrum, alkaloids, such as nicotine, 

caffeine (coffee, tea), quinine (cinchona bark), morphine (opium poppy), cocaine (coca leaves), 
ricinine (a poison in castor oil beans), strychnine (Strychnos nux vomica), coniine (spotted 
hemlock, the poison used by Socrates), and LSD (a hallucigen from the ergot fungus attacking 
grain), rotenone, Limonene or d-Limonene, neem, Azadirachtin (Azatin® is marketed as an 
25 insect growth regulator, and Align® and Nemix®). 

1 63. Examples of synergists or activators are not insecticides per se, but rather enhance 
the activity of insecticides having a primary insecticidal effect. Examples include, piperonyl 
butoxide, and contain the methylenedioxyphenyl moiety (found in sesame seed oil (sesamin)). 

164. Examples of antibiotics include avermectins, Abamectin, Clinch®, Emamectin 
30 benzoate (Proclaim®, Denim®). 

165. Examples of fumigants typically contain one or more halogens, such as methyl 
bromide (Aspelin and Grube 1998), ethylene dichloride, hydrogen cyanide, sulfuryl fluoride 
(Vikane®), Vapam®, Telone® II, D-D®, chlorothene, ethylene oxide, napthalene crystals, 
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paradichlorobenzene crystals, Phosphine gas (PH 3 ) produced by alunimum or magnesium 
phosphide pellets. 

166. Examples of insect repellants include dimethyl phthalate, Indalone®, Rutgers 
612®, dibutyl phthalate, various MGK® repellents, benzyl benzoate, the military clothing 

5 repellent (N-butyl acetanilide), dimethyl carbate (Dimelone®) and diethyl toluamide (DEET, 
Delphene®). 

167. Examples of inorganics include sulfur, mercury, boron, thallium, arsenic, 
antimony, selenium, and fluoride, arsenicals, including copper arsenate, Paris green, lead 
arsenate, and calcium arsenate, inorganic fluorides such as sodium fluoride, barium fluosilicate, 

10 sodium silicofluoride, and cryolite (Kryocide®), Boric acid, Sodium borate (disodium octaborate 
tetrahydrate) (Tim-Bor®, Bora-Care®), silica gels or silica aerogels, such as Dri-Die®, 
Drianone®, and Silikil Microcel®. 

1 68. Other compounds not easily categorized include cyromazine (Larvadex®, 
Trigard®), a triazine, pyriproxyfen (Knack®, Esteem®, Archer®), insect growth inhibitors such 

15 as buprofezin (Applaud®) and thiadiazines, tetrazines, such as clofentezine (Apollo®, 
Acaristop®), Enzone®, sodium tetrathiocarbonate, and Clandosan®. 

169. Also used are Veratrum Alkaloids, such as sabadilla, veratridine, and cevadine. 

170. Also used are ryanoids, such as ryanodine, 1 0-(O-methyl)-ryanodine, 9,21- 
dehydroryanodine, ryanodol, and 9,21-dehydroryanodine. 

20 171. Also used are octopamines mimics, such as amitraz® and chlordimeform. 

172. Also included are respiration inhibitors, such as fenazaquin, pyridaben, 
amidinohydrazone, hydramethylnon and the perfluorooctanesulfonamide, and sulfluramid. 

173. Also included are juvenile hormone mimics, such a juvenile hormone HI, 
methoprene, and fenoxycarb. 

25 174. Also included are toxins produced by Bacillus thuringiensis, such as Dipel®, 

Javelin®, Agree®. 

C. Compositions 

175. Disclosed are the components to be used to prepare the disclosed compositions as 
well as the compositions themselves to be used within the methods disclosed herein. These and 
30 other materials are disclosed herein, and it is understood that when combinations, subsets, 

interactions, groups, etc. of these materials are disclosed that while specific reference of each 
various individual and collective combinations and permutation of these compounds may not be 
explicitly disclosed, each is specifically contemplated and described herein. For example, if a 
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particular DHR96 or variants or fragments thereof is disclosed and discussed and a number of 
modifications that can be made to a number of molecules including the DHR96 or variants or 
fragments thereof are discussed, specifically contemplated is each and every combination and 
permutation of DHR96 or variants or fragments thereof and the modifications that are possible 
5 unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are 

disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, 
A-D is disclosed, then even if each is not individually recited each is individually and 
collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and OF 
are considered disclosed. Likewise, any subset or combination of these is also disclosed. Thus, 

10 for example, the sub-group of A-E, B-F, and C-E would be considered disclosed. This concept 
applies to all aspects of this application including, but not limited to, steps in methods of making 
and using the disclosed compositions. Thus, if there are a variety of additional steps that can be 
performed it is understood that each of these additional steps can be performed with any specific 
embodiment or combination of embodiments of the disclosed methods. 

15 1. Sequence similarities 

1 76. It is understood that as discussed herein the use of the terms homology and 
identity mean the same thing as similarity. Thus, for example, if the use of the word homology 
is used between two non-natural sequences it is understood that this is not necessarily indicating 
an evolutionary relationship between these two sequences, but rather is looking at the similarity 

20 or relatedness between their nucleic acid sequences. Many of the methods for determining 

homology between two evolutionarily related molecules are routinely applied to any two or more 
nucleic acids or proteins for the purpose of measuring sequence similarity regardless of whether 
they are evolutionarily related or not. 

177. In general, it is understood that one way to define any known variants and 
25 derivatives or those that might arise, of the disclosed genes and proteins herein, is through 

defining the variants and derivatives in terms of homology to specific known sequences. This 
identity of particular sequences disclosed herein is also discussed elsewhere herein. In general, 
variants of genes and proteins herein disclosed typically have at least, about 70, 71, 72, 73, 74, 
75, 76, 77, 78, 79, 80, 81 , 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 
30 percent homology to the stated sequence or the native sequence. Those of skill in the art readily 
understand how to determine the homology of two proteins or nucleic acids, such as genes. For 
example, the homology can be calculated after aligning the two sequences so that the homology 
is at its highest level. 
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1 78. Another way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for 

5 similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI), or by inspection. 

179. The same types of homology can be obtained for nucleic acids by for example the 
10 algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. 

Sci. USA 86:7706-7710, 1989, Jaeger etal. Methods Enzymol. 183:281-306, 1989 which are 
herein incorporated by reference for at least material related to nucleic acid alignment. It is 
understood that any of the methods typically can be used and that in certain instances the results 
of these various methods may differ, but the skilled artisan understands if identity is found with 
1 5 at least one of these methods, the sequences would be said to have the stated identity, and be 
disclosed herein. 

1 80. For example, as used herein, a sequence recited as having a particular percent 
homology to another sequence refers to sequences that have the recited homology as calculated 
by any one or more of the calculation methods described above. For example, a first sequence 

20 has 80 percent homology, as defined herein, to a second sequence if the first sequence is 
calculated to have 80 percent homology to the second sequence using the Zuker calculation 
method even if the first sequence does not have 80 percent homology to the second sequence as 
calculated by any of the other calculation methods. As another example, a first sequence has 80 
percent homology, as defined herein, to a second sequence if the first sequence is calculated to 

25 have 80 percent homology to the second sequence using both the Zuker calculation method and 
the Pearson and Lipman calculation method even if the first sequence does not have 80 percent 
homology to the second sequence as calculated by the Smith and Waterman calculation method, 
the Needleman and Wunsch calculation method, the Jaeger calculation methods, or any of the 
other calculation methods. As yet another example, a first sequence has 80 percent homology, as 

30 defined herein, to a second sequence if the first sequence is calculated to have 80 percent 

homology to the second sequence using each of calculation methods (although, in practice, the 
different calculation methods will often result in different calculated homology percentages). 
2. Hybridization/selective hybridization 
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181. The term hybridization typically means a sequence driven interaction between at 
least two nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven 
interaction means an interaction that occurs between two nucleotides or nucleotide analogs or 
nucleotide derivatives in a nucleotide specific manner. For example, G interacting with C or A 
5 interacting with T are sequence driven interactions. Typically sequence driven interactions occur 
on the Watson-Crick face or Hoogsteen face of the nucleotide. The hybridization of two nucleic 
acids is affected by a number of conditions and parameters known to those of skill in the art. For 
example, the salt concentrations, pH, and temperature of the reaction all affect whether two 
nucleic acid molecules will hybridize. 

10 1 82. Parameters for selective hybridization between two nucleic acid molecules are 

well known to those of skill in the art. For example, in some embodiments selective 
hybridization conditions can be defined as stringent hybridization conditions. For example, 
stringency of hybridization is controlled by both temperature and salt concentration of either or 
both of the hybridization and washing steps. For example, the conditions of hybridization to 

15 achieve selective hybridization may involve hybridization in high ionic strength solution (6X 
SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the melting temperature 
at which half of the molecules dissociate from their hybridization partners) followed by washing 
at a combination of temperature and salt concentration chosen so that the washing temperature is 
about 5°C to 20°C below the Tm. The temperature and salt conditions are readily determined 

20 empirically in preliminary experiments in which samples of reference DNA immobilized on 
filters are hybridized to a labeled nucleic acid of interest and then washed under conditions of 
different stringencies. Hybridization temperatures are typically higher for DNA-RNA and RNA- 
RNA hybridizations. The conditions can be used as described above to achieve stringency, or as 
is known in the art. (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold 

25 Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989; Kunkel et al. Methods 

Enzymol. 1 987: 1 54:367, 1 987 which is herein incorporated by reference for material at least 
related to hybridization of nucleic acids). A preferable stringent hybridization condition for a 
DNA:DNA hybridization can be at about 68°C (in aqueous solution) in 6X SSC or 6X SSPE 
followed by washing at 68°C. Stringency of hybridization and washing, if desired, can be 

30 reduced accordingly as the degree of complementarity desired is decreased, and further, 
depending upon the G-C or A-T richness of any area wherein variability is searched for. 
Likewise, stringency of hybridization and washing, if desired, can be increased accordingly as 
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homology desired is increased, and further, depending upon the G-C or A-T richness of any area 
wherein high homology is desired, all as known in the art. 

1 83. Another way to define selective hybridization is by looking at the amount 
(percentage) of one of the nucleic acids bound to the other nucleic acid. For example, in some 

5 embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71 , 72, 
73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 
99, 100 percent of the limiting nucleic acid is bound to the non-limiting nucleic acid. Typically, 
the non-limiting primer is in for example, 10 or 100 or 1000 fold excess. This type of assay can 
be performed at under conditions where both the limiting and non-limiting primer are for 

10 example, 10 fold or 100 fold or 1000 fold below their kd, or where only one of the nucleic acid 
molecules is 10 fold or 100 fold or 1000 fold or where one or both nucleic acid molecules are 
above their k^. 

1 84. Another way to define selective hybridization is by looking at the percentage of 
primer that gets enzymatically manipulated under conditions where hybridization is required to 

15 promote the desired enzymatic manipulation. For example, in some embodiments selective 

hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the 
primer is enzymatically manipulated under conditions which promote the enzymatic 
manipulation, for example if the enzymatic manipulation is DNA extension, then selective 

20 hybridization conditions would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the 
primer molecules are extended. Preferred conditions also include those suggested by the 
manufacturer or indicated in the art as being appropriate for the enzyme performing the 
manipulation. 

25 1 85. Just as with homology, it is understood that there are a variety of methods herein 

disclosed for determining the level of hybridization between two nucleic acid molecules. It is 
understood that these methods and conditions may provide different percentages of hybridization 
between two nucleic acid molecules, but unless otherwise indicated meeting the parameters of 
any of the methods would be sufficient. For example if 80% hybridization was required and as 

30 long as hybridization occurs within the required parameters in any one of these methods it is 
considered disclosed herein. 
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1 86. It is understood that those of skill in the art understand that if a composition or 
method meets any one of these criteria for determining hybridization either collectively or singly 
it is a composition or method that is disclosed herein. 
3. Nucleic acids 

5 187. There are a variety of molecules disclosed herein that are nucleic acid based, 

including for example the nucleic acids that encode, for example DHR96 or variants or 
fragments thereof, as well as various functional nucleic acids. The disclosed nucleic acids are 
made up of for example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting 
examples of these and other molecules are discussed herein. It is understood that for example, 

1 0 when a vector is expressed in a cell, that the expressed mRNA will typically be made up of A, C, 
G, and U. Likewise, it is understood that if, for example, an antisense molecule is introduced 
into a cell or cell environment through for example exogenous delivery, it is advantagous that the 
antisense molecule be made up of nucleotide analogs that reduce the degradation of the antisense 
molecule in the cellular environment. 

15 a) Nucleotides and related molecules 

1 88. A nucleotide is a molecule that contains a base moiety, a sugar moiety and a 
phosphate moiety. Nucleotides can be linked together through their phosphate moieties and 
sugar moieties creating an internucleoside linkage. The base moiety of a nucleotide can be 
adenin-9-yl (A), cytosin-l-yl (C), guanin-9-yl (G), uracil-l-yl (U), and thymin-l-yl (T). The 

20 sugar moiety of a nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide 
is pentavalent phosphate. An non-limiting example of a nucleotide would be 3 , -AMP (3'- 
adenosine monophosphate) or 5-GMP (5-guanosine monophosphate). 

1 89. A nucleotide analog is a nucleotide which contains some type of modification to 
either the base, sugar, or phosphate moieties. Modifications to the base moiety would include 

25 natural and synthetic modifications of A, C, G, and T/U as well as different purine or pyrimidine 
bases, such as uracil-5-yl (.psi.), hypoxanthin-9-yl (I), and 2-aminoadenin-9-yl. A modified base 
includes but is not limited to 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, 
hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 
2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 

30 190. 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo 

uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 
8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 
5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 
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7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 
3-deazaguanine and 3-deazaadenine. Additional base modifications can be found for example in 
U.S. Pat. No. 3,687,808, Englisch etal., Angewandte Chemie, International Edition, 1991, 30, 
613, and Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, 
5 Crooke, S. T. and Lebleu, B. ed., CRC Press, 1993. Certain nucleotide analogs, such as 

5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 substituted purines, including 
2-aminopropyIadenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine can increase 
the stability of duplex formation. Often time base modifications can be combined with for 
example a sugar modification, such as 2-O-methoxyethyl, to achieve unique properties such as 

10 increased duplex stability. There are numerous United States patents such as 4,845,205; 
5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 
5,502,177; 5,525,71 1 ; 5,552,540; 5,587,469; 5,594,121, 5,596,091 ; 5,614,617; and 5,681,941, 
which detail and describe a range of base modifications. Each of these patents is herein 
incorporated by reference. 

15 191. Nucleotide analogs can also include modifications of the sugar moiety. 

Modifications to the sugar moiety would include natural modifications of the ribose and deoxy 
ribose as well as synthetic modifications. Sugar modifications include but are not limited to the 
following modifications at the 2' position: OH; F; 0-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- 
or N-alkynyl; or O-alkyl-O-alkyl, wherein the alky], alkenyl and alkynyl may be substituted or 

20 unsubstituted Ci to Cio, alkyl or C2 to C10 alkenyl and alkynyl. 2' sugar modiifcations also 

include but are not limited to -0[(CH 2 )„ 0] m CH 3 , -0(CH 2 ) n OCH 3 , -0(CH 2 ) n NH 2 , -0(CH 2 )„ 
CH 3 , -0(CH 2 ) n -ONH 2 , and -0(CH 2 )„ON[(CH 2 )„ CH 3 )] 2 , where n and m are from 1 to about 10. 

192. Other modifications at the 2' position include but are not limted to: Ci to C10 
lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH 3 , OCN, CI, 

25 Br, CN, CF 3 , OCF 3 , SOCH 3 , SO z CH 3 , ON0 2 , N0 2 , N 3 , NH 2 , heterocycloalkyl, 

heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, 
a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an 
oligonucleotide, or a group for improving the pharmacodynamic properties of an 
oligonucleotide, and other substituents having similar properties. Similar modifications may 

30 also be made at other positions on the sugar, particularly the 3' position of the sugar on the 3 f 
terminal nucleotide or in 2'-5' linked oligonucleotides and the 5' position of 5* terminal 
nucleotide. Modified sugars would also include those that contain modifications at the bridging 
ring oxygen, such as CH 2 and S. Nucleotide sugar analogs may also have sugar mimetics such as 
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cyclobutyl moieties in place of the pentofiiranosyl sugar. There are numerous United States 
patents that teach the preparation of such modified sugar structures such as 4,981,957; 
5,1 18,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 
5,567,81 1; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 
5 5,658,873; 5,670,633; and 5,700,920, each of which is herein incorporated by reference in its 
entirety. 

193. Nucleotide analogs can also be modified at the phosphate moiety. Modified 
phosphate moieties include but are not limited to those that can be modified so that the linkage 
between two nucleotides contains a phosphorothioate, chiral phosphorothioate, 

10 phosphorodithioate, phosphotriester, aminoalkylphosphotriester, methyl and other alkyl 
phosphonates including 3-alkylene phosphonate and chiral phosphonates, phosphinates, 
phosphoramidates including 3-amino phosphoramidate and aminoalkylphosphoramidates, 
thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and 
boranophosphates. It is understood that these phosphate or modified phosphate linkage between 

15 two nucleotides can be through a 3'-5' linkage or a 2-5' linkage, and the linkage can contain 
inverted polarity such as 3-5' to 5-3' or 2'-5' to 5'-2\ Various salts; mixed salts and free acid 
forms are also included. Numerous United States patents teach how to make and use nucleotides 
containing modified phosphates and include but are not limited to, 3,687,808; 4,469,863; 
4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 

20 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 
5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; and 5,625,050, each of 
which is herein incorporated by reference. 

1 94. It is understood that nucleotide analogs need only contain a single modification, 
but may also contain multiple modifications within one of the moieties or between different 

25 moieties. 

1 95. Nucleotide substitutes are molecules having similar functional properties to 
nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). 
Nucleotide substitutes are molecules that will recognize nucleic acids in a Watson-Crick or 
Hoogsteen manner, but which are linked together through a moiety other than a phosphate 

30 moiety. Nucleotide substitutes are able to conform to a double helix type structure when 
interacting with the appropriate target nucleic acid. 

1 96. Nucleotide substitutes are nucleotides or nucleotide analogs that have had the 
phosphate moiety and/or sugar moieties replaced. Nucleotide substitutes do not contain a 
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standard phosphorus atom. Substitutes for the phosphate can be for example, short chain alky] 
or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl intemucleoside 
linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. 
These include those having morpholino linkages (formed in part from the sugar portion of a 
5 nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones;formacetyl and 
thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene 
containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino 
backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, 
O, S and CH2 component parts. Numerous United States patents disclose how to make and use 

10 these types of phosphate replacements and include but are not limited to 5,034,506; 5,166,3 1 5; 
5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 
5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 
5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 
5,677,439, each of which is herein incorporated by reference. 

15 1 97. It is also understood in a nucleotide substitute that both the sugar and the 

phosphate moieties of the nucleotide can be replaced, by for example an amide type linkage 
(aminoethylglycine) (PNA). United States patents 5,539,082; 5,7 14,33 l;and 5,719,262 teach 
how to make and use PNA molecules, each of which is herein incorporated by reference. (See 
also Nielsen et ah, Science, 1991, 254, 1497-1500). 

20 1 98. It is also possible to link other types of molecules (conjugates) to nucleotides or 

nucleotide analogs to enhance for example, cellular uptake. Conjugates can be chemically linked 
to the nucleotide or nucleotide analogs. Such conjugates include but are not limited to lipid 
moieties such as a cholesterol moiety (Letsingeret al., Proc. Natl. Acad. Sci. USA, 1989, 

199. 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 

25 4, 1053-1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan etal., Ann. N.Y. Acad. Sci., 
1992, 660, 306-309; Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a 
thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., 
dodecandiol or undecyl residues (Saison-Behmoaras et al., EMBO J., 1991, 10, 1 1 1 1-1 118; 
Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., Biochimie, 1993, 75, 49-54), 

30 a phospholipid, e.g., di-hexadecyl-rac-glycerol or triethylammonium 

l,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36, 
3651-3654; Shea et al., Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene 
glycol chain (Manoharan et al., Nucleosides & Nucleotides, 1995, 14, 969-973), or adamantane 
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acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety 

(Mishra et al., Biochim. Biophys. Acta, 1995, 1264, 229-237), or an octadecylamine or 

hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 

277, 923-937. Numerous United States patents teach the preparation of such conjugates and 
5 include, but are not limited to U.S. Pat. Nos. 4,828,979; 4,948,882; 5,218,105; 5,525,465; 

5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 5,580,731; 5,591,584; 5,109,124; 

5,1 18,802; 5,138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 

4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941; 4,835,263; 4,876,335; 4,904,582; 

4,958,013; 5,082,830; 5,112,963; 5,214,136; 5,082,830; 5,112,963; 5,214,136; 5,245,022; 
10 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391,723; 

5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5,565,552; 5,567,810; 5,574,142; 

5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 5,688,941, each of which 

is herein incorporated by reference. 

200. A Watson-Crick interaction is at least one interaction with the Watson-Crick face 
15 of a nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of a 

nucleotide, nucleotide analog, or nucleotide substitute includes the C2, Nl, and C6 positions of a 
purine based nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3, C4 positions 
of a pyrimidine based nucleotide, nucleotide analog, or nucleotide substitute. 

201 . A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face 
20 of a nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The 

Hoogsteen face includes the N7 position and reactive groups (NH2 or O) at the C6 position of 
purine nucleotides. 

b) Sequences 

202. There are a variety of sequences related to the DHR96 gene, and these sequences 
25 and others are herein incorporated by reference in their entireties as well as for individual 

subsequences contained therein. 

203. One particular sequence set forth in SEQ ID NO:7 and having Genbank accession 
number NM 079769 is used herein, as an example, to exemplify the disclosed compositions and 
methods. It is understood that the description related to this sequence is applicable to any 

30 sequence related to DHR96 or any other sequences disclosed herein, unless specifically indicated 
otherwise. Those of skill in the art understand how to resolve sequence discrepancies and 
differences and to adjust the compositions and methods relating to a particular sequence to other 
related sequences (i.e. sequences of DHR96 or variants or fragments thereof). Primers and/or 
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probes can be designed for any DHR96 sequence given the information disclosed herein and 
known in the art. 

c) Primers and probes 

204. Disclosed are compositions including primers and probes, which are capable of 
5 interacting with the genes disclosed herein. In certain embodiments the primers are used to 

support DNA amplification reactions. Typically the primers will be capable of being extended in 
a sequence specific manner. Extension of a primer in a sequence specific manner includes any 
methods wherein the sequence and/or composition of the nucleic acid molecule to which the 
primer is hybridized or otherwise associated directs or influences the composition or sequence of 

10 the product produced by the extension of the primer. Extension of the primer in a sequence 
specific manner therefore includes, but is not limited to, PCR, DNA sequencing, DNA 
extension, DNA polymerization, RNA transcription, or reverse transcription. Techniques and 
conditions that amplify the primer in a sequence specific manner are preferred. In certain 
embodiments the primers are used for the DNA amplification reactions, such as PCR or direct 

15 sequencing. It is understood that in certain embodiments the primers can also be extended using 
non-enzymatic techniques, where for example, the nucleotides or oligonucleotides used to 
extend the primer are modified such that they will chemically react to extend the primer in a 
sequence specific manner. Typically the disclosed primers hybridize with the nucleic acid or 
region of the nucleic acid or they hybridize with the complement of the nucleic acid or 

20 complement of a region of the nucleic acid. 

4. Delivery of the compositions to cells 

205. There are a number of compositions and methods which can be used to deliver 
nucleic acids to cells, either in vitro or in vivo. These methods and compositions can largely be 
broken down into two classes: viral based delivery systems and non-viral based delivery systems. 

25 For example, the nucleic acids can be delivered through a number of direct delivery systems 
such as, electroporation, lipofection, calcium phosphate precipitation, plasmids, viral vectors, 
viral nucleic acids, phage nucleic acids, phages, cosmids, or via transfer of genetic material in 
cells or carriers such as cationic liposomes. Appropriate means for transfection, including viral 
vectors, chemical transfectants, or physico-mechanical methods such as electroporation and 

30 direct diffusion of DNA, are described by, for example, Wolff, J. A., et al., Science, 247, 1465- 
1468, (1990); and Wolff, J. A. Nature, 352, 815-818, (1991)Such methods are well known in 
the art and readily adaptable for use with the compositions and methods described herein. In 
certain cases, the methods will be modifed to specifically function with large DNA molecules. 
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Further, these methods can be used to target certain diseases and cell populations by using the 
targeting characteristics of the carrier. 

a) Nucleic acid based delivery systems 

206. The term "transgene" is used herein to describe genetic material which is 

5 artificially inserted into the genome of an invertebrate cell. The transgene encodes a product that, 
when expressed in embryos, gives rise to a specific phenotype. A transgene can encode a 
transcription factor or mimetic thereof having the desired result. A recombinant DNA molecule 
or vector containing a heterologous protein gene expression unit can be used to transfect 
invertebrate cells (United States Patents 4,670,388 and 5,550,043, herein incorporated by 

10 reference in their entirety.) A gene expression unit can contain a DNA coding sequence for a 

selected protein or for a derivative thereof. Such derivatives can be obtained by manipulation of 
the gene sequence using traditional genetic engineering techniques, e.g., mutagenesis, restriction 
endonuclease treatment, ligation of other gene sequences including synthetic sequences and the 
like (T. Maniatis et al, Molecular Cloning, A Laboratory Manual., Cold Spring Harbor 

1 5 Laboratory, Cold Spring Harbor, N. Y. ( 1 982). 

207. Expression of the transgene can be targeted to occur in a non-adult stage of the 
animal, the transgene can be stably integrated into the genome of the animal in a manner such 
that its expression is controlled both spatially and temporally to the desired cell type and the 
correct developmental stage, i.e. to expression in embryonic neuroblasts. Specifically, the subject 

20 transgene can stably integrated into the genome of the animal under the control of a promoter 

that provides for expression. The transgene may be under the control of any convenient promoter 
that provides for this requisite spatial and temporal expression pattern, where the promoter can 
be endogenous or exogenous. A suitable promoter is the promoter located in the Drosophila 
melanogaster genome at position 86E1-3. 

25 208. Another suitable promoter of the Drosophila origin includes the Drosophila 

metallothionein promoter (Lastowski-Perry et al, J. Biol. Chem., 260:1527, 1985). This inducible 
promoter directs high-level transcription of the gene in the presence of metals, e.g., CuS04. Use 
of the Drosophila metallothionein promoter results in the expression system of the invention 
retaining full regulation even at very high copy number. This is in direct contrast to the use of the 

30 mammalian metallothionein promoter in mammalian cells in which the regulatory effect of the 
metal is diminished as copy number increases. In the Drosophila expression system, this retained 
inducibility effect increases expression of the gene product in the Drosophila cell at high copy 
number. 
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209. The Drosophila actin 5C gene promoter (B. J. Bond et al, Mol. CelL Biol., 6: 
2080, 1986) is also a desirable promoter sequence. The actin 5C promoter is a constitutive 
promoter and does not require addition of metal. Therefore, it is better-suited for use in a large 
scale production system, like a perfusion system, than is the Drosophila metallothionein 

5 promoter. An additional advantage is that the absence of a high concentration of copper in the 
media maintains the cells in a healthier state for longer periods of time. 

210. Examples of other known Drosophila promoters include, e.g., the inducible 
heatshock (Hsp70) and COPIA LTR promoters. The SV40 early promoter gives lower levels of 
expression than the Drosophila metallothionein promoter. 

10 211. The transgene may be integrated into the fly genome in a manner that provides for 

direct or indirect expression activation by the promoter, i.e. in a manner that provides for either 
cis or trans activation of gene expression by the promoter. In other words, expression' of the 
transgene may be mediated directly by the promoter, or through one or more transactivating 
agents. Where the transgene is under direct control of the promoter, i.e. the promoter regulates 

1 5 expression of the transgene in a cis fashion, the transgene is stably integrated into the genome of 
the fly at a site sufficiently proximal to the promoter and in frame with the promoter such that cis 
regulation by the promoter occurs. 

212. In other embodiments where expression of the transgene is indirectly mediated by 
the endogenous promoter, the promoter controls expression of the transgene through one or more 

20 transactivating agents, usually one transactivating agent, i.e. an agent whose expression is 

directly controlled by the promoter and which binds to the region of the transgene in a manner 
sufficient to turn on expression of the transgene. Any convenient transactivator may be 
employed. The GAL4 transactivator system an example of such a system. 

213. The G AL4 encoding sequence can be stably integrated into the genome of the 
25 animal in a manner such that it is operatively linked to the endogenous promoter that provides 

expression in the appropriate location. The GAL4 system consists of the yeast transcriptional 
activator G AL4 and its target the upstream activating sequence (UAS) located within the P- 
element. Initially, GAL4 and UAS are in separate lines. The UAS is mobilized to generate new 
UAS insertion lines which remain silent until a source of GAL4 is made available. Under the 
30 control of a promoter, the expression of GAL4 is directed in a particular pattern. Specialized 
promoters can be used to drive expression of GAL4 in tissue and cell specific manners. The 
GAL4 containing line is then crossed to the UAS containing line. The UAS in the presence of 
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G AL4 directs the expression of any genes adjacent to its insertion site. When the insertion site is 
located upstream from the coding region over-or ectopic expression occurs. 

214. Flies of line 31-1 (also referred to as 1822), as disclosed in Brand & Perrimon, 
Development (1 993) 1 1 8: 401 -41 5 express GAL4 in this manner, and are known to those of skill 

5 in the art. The transgene is stably integrated into a different location of the genome, generally a 
random location in the genome, where the transgene is operatively linked to an upstream 
activator sequence, i.e. UAS sequence, to which GAM binds and turns on expression of the 
transgene. Transgenic flies having a UAS: GAL4 transactivation system are known to those of 
skill in the art and are described in Brand & Perrimon, Development (1993) 1 1 8: 401 -41 5; and 
10 Phelps & Brand, Methods (April 1998) 14:367-379. 

215. A desirable gene expression unit or expression vector for the protein of interest 
cal also be constructed by fusing the protein coding sequence to a desirable signal sequence. The 
signal sequence functions to direct secretion of the protein from the host cell. Such a signal 
sequence may be derived from the sequence of tissue plasminogen activator (tPA). Other 

15 available signal sequences include, e.g., those derived from Herpes Simplex virus gene HSV-I 
gD (Lasky et al, Science, 233:209-212 1986). 

216. The DNA coding sequence can also be followed by a polyadenylation (poly A) 
region, such as an SV40 early poly A region. The poly A region which functions in the 
polyadenylation of RNA transcripts appears to play a role in stabilizing transcription. A similar 

20 poly A region can be derived from a variety of genes in which it is naturally present. This region 
can also be modified to alter its sequence provided that polyadenylation and transcript 
stabilization functions are not significantly adversely affected. 

217. The recombinant DNA molecule may also carry a genetic selection marker, as 
well as the protein gene functions. The selection marker can be any gene or genes which cause a 

25 readily detectable phenotypic change in a transfected host cell. Such phenotypic change can be, 
for example, drug resistance, such as the gene for hygromycin B resistance (i.e., hygromycin B 
phosphotransferase). 

218. Alternatively, a selection system using the drug methotrexate, and prokaryotic 
dihydrofolate reductase (DHFR) gene, can be used with Invertebrate cells. The endogenous 

30 eukaryotic DHFR of the cells is inhibited by methotrexate. Therefore, by transfecting the cells 
with a plasmid containing the prokaryotic DHFR which is insensitive to methotrexate and 
selecting with methotrexate, only cells transfected with and expressing the prokaryotic DHFR 
will survive. Unlike methotrexate, selection of transformed mammalian and bacterial cells, in the 
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Drosophila system, methotrexate can be used to initially high-copy number transfectants. Only 
cells which have incorporated the protective prokaryotic DHFR gene will survive. 
Concomitantly, these cells have the gene expression unit of interest. 

219. The subject transgenic flies can be prepared using any convenient protocol that 
5 provides for stable integration of the transgene into the fly genome in a manner sufficient to 

provide for the requisite spatial and temporal expression of the transgene, i.e. in embryonic 
neuroblasts. A number of different strategies can be employed to obtain the integration of the 
transgene with the requisite expression pattern. Generally, methods of producing the subject 
transgenic flies involve stable integration of the transgene into the fly genome. Stable integration 

10 is achieved by first introducing the transgene into a cell or cells of the fly, e.g. a fly embryo. The 
transgene is generally present on a suitable vector, such as a plasmid. Transgene introduction 
may be accomplished using any convenient protocol, where suitable protocols include: 
electroporation, microinjection, vesicle delivery, e.g. liposome delivery vehicles, and the like. 
Following introduction of the transgene into the cell(s), the transgene is stably integrated into the 

1 5 genome of the cell. Stable integration may be either site specific or random, but is generally 
random. 

220. Where integration is random, the transgene is typically integrated with the use of 
transposase. In such embodiments, the transgene can be introduced into the cell(s) within a 
vector that includes the requisite P element, terminal 31 base pair inverted repeats. Where the 

20 cell into which the transgene is to be integrated does not comprise an endogenous transposase, a 
vector encoding a transposase can also be introduced into the cell, e.g. a helper plasmid 
comprising a transposase gene, such as pTURBO (Steller & Pirrotta, Mol. Cell. Biol. 6:1640- 
1649, 1986). Methods of random integration of transgenes into the genome of a target 
Drosophila melanogaster cell(s) are disclosed in U.S. Pat. No. 4,670,388, the disclosure of which 

25 is herein incorporated by reference. 

221 . Transcription and expression of the heterologous protein coding sequences can be 
monitored. For example, Southern blot analysis can be used to determine copy number of the 
gpl20 gene. Northern blot analysis provides information regarding the size of the transcribed 
gene sequence. The level of transcription can also be quantitated. Expression of the selected 

30 protein in the recombinant cells can be further verified through Western blot analysis, for 
example. 

222. In those embodiments in which the transgene is stably integrated in a random 
fashion into the fly genome, means are also provided for selectively expressing the transgene at 
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the appropriate time during development of the fly. In other words, means are provided for 
obtaining targeted expression of the transgene. To obtain the desired targeted expression of the 
randomly integrated transgene, integration of particular promoter upstream of the transgene, as a 
single unit in the P element vector may be employed. Alternatively, a transactivator that mediates 
5 expression of the transgene may be employed. Of particular interest is the GAL4 system 
described in Brand & Perrimon, Development (1993) 118: 401-415; and Phelps & Brand, 
Methods (April 1998) 14:367-379. 

223. In one embodiment, the subject transgenic flies are produced by: (1) generating 
two separate lines of transgenic flies: (a) a first line that expresses GAL4; and (b) a second line 

10 in which the transgene is stably integrated into the cell genome and is fused to a UAS domain; 
(2) crossing the two lines; and (3) screening the progeny for the desired phenotype, i.e. adult 
onset neurodegeneration. Each of the above steps are well known to those of skill in the art 
(Brand & Perrimon, Development 1 18: 401-415, 1993; and Phelps & Brand, Methods 14:367- 
379, April 1998.) 

15 b) Non-nucleic acid based systems 

224. The disclosed compositions can be delivered to the target cells in a variety of 
ways. For example, the compositions can be delivered through electroporation, or through 
lipofection, or through calcium phosphate precipitation. The delivery mechanism chosen will 
depend in part on the type of cell targeted and whether the delivery is occurring for example in 

20 vivo or in vitro. 

225. Thus, the compositions can comprise, in addition to the disclosed compositions or 
vectors for example, lipids such as liposomes, such as cationic liposomes (e.g., DOTMA, DOPE, 
DC-cholesterol) or anionic liposomes. Liposomes can further comprise proteins to facilitate 
targeting a particular cell, if desired. Administration of a composition comprising a compound 

25 and a cationic liposome can be administered to the blood afferent to a target organ or inhaled into 
the respiratory tract to target cells of the respiratory tract. Regarding liposomes, see, e.g., 
Brighametal. Am. J. Resp. Cell Mol Biol 1:95-100 (1989); Feigner etal. Proc. Natl. 
Acad. SciUSA 84:7413-7417(1987); U.S. Pat. No.4,897,355. Furthermore, the compound 
can be administered as a component of a microcapsule that can be targeted to specific cell types, 

30 such as macrophages, or where the diffusion of the compound or delivery of the compound from 
the microcapsule is designed for a specific rate or dosage. 

226. In the methods described above which include the administration and uptake of 
exogenous DNA into the cells of a subject (i.e., gene transduction or transfection), delivery of 
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ihe compositions to cells can be via a variety of mechanisms. As one example, delivery can be 
via a liposome, using commercially available liposome preparations such as LIPOFECTIN, 
LTPOFECTAMINE (GIBCO-BRL, Inc., Gaithersburg, MD), SUPERFECT (Qiagen, Inc. 
Hilden, Germany) and TRANSFECTAM (Promega Biotec, Inc., Madison, WI), as well as other 
5 liposomes developed according to procedures standard in the art. In addition, the disclosed 
nucleic acid or vector can be delivered in vivo by electroporation, the technology for which is 
available from Genetronics, Inc. (San Diego, CA) as well as by means of a SONOPORATION 
machine (ImaRx Pharmaceutical Corp., Tucson, AZ). 

227. The materials may be in solution, suspension (for example, incorporated into 

1 0 microparticles, liposomes, or cells). These may be targeted to a particular cell type via 

antibodies, receptors, or receptor ligands. The following references are examples of the use of 
this technology to target specific proteins to tumor tissue (Senter, et al., Bioconjugate Chem. , 
2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer. 60:275-281, (1989); Bagshawe, et al., Br. J. 
Cancer, 58:700-703, (1988); Senter, et al., Bioconjugate Chem., 4:3-9, (1993); Battelli, et al., 

15 Cancer Immunol. Immunother.. 35:421-425, (1992); Pietersz and McKenzie. Immunolog. 

Reviews . 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol 42:2062-2065, (1991)). 
These techniques can be used for a variety of other speciifc cell types. Vehicles such as "stealth" 
and other antibody conjugated liposomes (including lipid mediated drug targeting to colonic 
carcinoma), receptor mediated targeting of DNA through cell specific ligands, lymphocyte 

20 directed tumor targeting, and highly specific therapeutic retroviral targeting of murine glioma 
cells in vivo. The following references are examples of the use of this technology to target 
specific proteins to tumor tissue (Hughes et al., Cancer Research. 49:6214-6220, (1989); and 
Litzinger and Huang, Biochimica et Biophvsica Acta. 1 104:179-187, (1992)). In general, 
receptors are involved in pathways of endocytosis, either constitutive or ligand induced. These 

25 receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated vesicles, pass through 
an acidified endosome in which the receptors are sorted, and then either recycle to the cell 
surface, become stored intracellularly, or are degraded in lysosomes. The internalization 
pathways serve a variety of functions, such as nutrient uptake, removal of activated proteins, 
clearance of macromolecules, opportunistic entry of viruses and toxins, dissociation and 

30 degradation of ligand, and receptor-level regulation. Many receptors follow more than one 

intracellular pathway, depending on the cell type, receptor concentration, type of ligand, ligand 
valency, and ligand concentration. Molecular and cellular mechanisms of receptor-mediated 
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endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 10:6, 399-409 
(1991)). 

228. Nucleic acids that are delivered to cells which are to be integrated into the host 
cell genome, typically contain integration sequences. These sequences are often viral related 

5 sequences, particularly when viral based systems are used. These viral intergration systems can 
also be incorporated into nucleic acids which are to be delivered using a non-nucleic acid based 
system of deliver, such as a liposome, so that the nucleic acid contained in the delivery system 
can be come integrated into the host genome. 

229. Other general techniques for integration into the host genome include, for 

10 example, systems designed to promote homologous recombination with the host genome. These 
systems typically rely on sequence flanking the nucleic acid to be expressed that has enough 
homology with a target sequence within the host cell genome that recombination between the 
vector nucleic acid and the target nucleic acid takes place, causing the delivered nucleic acid to 
be integrated into the host genome. These systems and the methods necessary to promote 

15 homologous recombination are known to those of skill in the art. 

c) In vivo/ex vivo 

230. As described above, the compositions can be administered in a pharmaceutically 
acceptable carrier and can be delivered to the subject=s cells in vivo and/or ex vivo by a variety 
of mechanisms well known in the art (e.g., uptake of naked DNA, liposome fusion, 

20 intramuscular injection of DNA via a gene gun, endocytosis and the like). 

23 1 . If ex vivo methods are employed, cells or tissues can be removed and maintained 
outside the body according to standard protocols well known in the art. The compositions can be 
introduced into the cells via any gene transfer mechanism, such as, for example, calcium 
phosphate mediated gene delivery, electroporation, microinjection or proteoliposomes. The 

25 transduced cells can then be infused (e.g., in a pharmaceutically acceptable carrier) or 

homotopically transplanted back into the subject per standard methods for the cell or tissue type. 
Standard methods are known for transplantation or infusion of various cells into a subject. 
5. Peptides 

a) Protein variants 

30 232. As discussed herein there are numerous variants of the DHR96 protein that are 

known and herein contemplated. In addition, to the known functional DHR96 strain variants 
there are derivatives of the DHR96 protein which also function in the disclosed methods and 
compositions. Protein variants and derivatives are well understood to those of skill in the art and 
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in can involve amino acid sequence modifications. For example, amino acid sequence 
modifications typically fall into one or more of three classes: substitutional, insertional or 
deletional variants. Insertions include amino and/or carboxyl terminal fusions as well as 
intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be 
5 smaller insertions than those of amino or carboxyl terminal fusions, for example, on the order of 
one to four residues. Immunogenic fusion protein derivatives, such as those described in the 
examples, are made by fusing a polypeptide sufficiently large to confer immunogenicity to the 
target sequence by cross-linking in vitro or by recombinant cell culture transformed with DNA 
encoding the fusion. Deletions are characterized by the removal of one or more amino acid 

10 residues from the protein sequence. Typically, no more than about from 2 to 6 residues are 

deleted at any one site within the protein molecule. These variants ordinarily are prepared by site 
specific mutagenesis of nucleotides in the DNA encoding the protein, thereby producing DNA 
encoding the variant, and thereafter expressing the DNA in recombinant cell culture. Techniques 
for making substitution mutations at predetermined sites in DNA having a known sequence are 

15 well known, for example M 1 3 primer mutagenesis and PCR mutagenesis. Amino acid 

substitutions are typically of single residues, but can occur at a number of different locations at 
once; insertions usually will be on the order of about from 1 to 10 amino acid residues; and 
deletions will range about from 1 to 30 residues. Deletions or insertions preferably are made in 
adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. Substitutions, deletions, 

20 insertions or any combination thereof may be combined to arrive at a final construct. The 
mutations must not place the sequence out of reading frame and preferably will not create 
complementary regions that could produce secondary mRNA structure. Substitutional variants 
are those in which at least one residue has been removed and a different residue inserted in its 
place. Such substitutions generally are made in accordance with the following Tables 1 and 2 

25 and are referred to as conservative substitutions. 

233. TABLE 1 : Amino Acid Abbreviations 



Amino Acid 


Abbreviations 


alanine 


AlaA 


allosoleucine 


Alle 


arginine 


ArgR 


asparagine 


AsnN 


aspartic acid 


AspD 


cysteine 


CysC 


glutamic acid 


GIuE 


glutamine 


GInK 


glycine 


GlyG 


hisridine 


HisH 


isolelucine 


lie! 
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Amino Acid 


Abb re vial ions 


It 1 llv 


LeuL 


lysine 


LysK 


phenylalanine 


PheF 


proline 


ProP 


pyroglutamic acidp 


Glu 


serine 


SerS 


threonine 


ThrT 


tyrosine 


TyrY 


tryptophan 


TrpW 


valine 


ValV 



TABLE 2:Amino Acid Substitutions 

Original Residue Exemplary Conservative Substitutions, others are known in the art. 

A laser 

Ar g 1v$ » S ln 

Asngln; his 

Aspgltt 

Cysser 

Gjnasn, lys 

Gluasp 

Glv P ro 

Hisasn;gln 

lleleu; val 

Leuile; val 

Lysarg; gin; 

MetLeu; ile 

Phemet; leu; tyr 

Serthr 

Thrser 

Trptyr 

Tyrtrp; phe 

Valile; leu ; 

234. Substantial changes in function or immunological identity are made by selecting 

substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ 

more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in 

5 the area of the substitution, for example as a sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The 
substitutions which in general are expected to produce the greatest changes in the protein 
properties will be those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted 
for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a 

10 cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this case, (e) 
by increasing the number of sites for sulfation and/or glycosylation. 
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235. For example, the replacement of one amino acid residue with another that is 
biologically and/or chemically similar is known to those skilled in the art as a conservative 
substitution. For example, a conservative substitution would be replacing one hydrophobic 
residue for another, or one polar residue for another. The substitutions include combinations 

5 such as, for example, Gly, Ala; Val, lie, Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, 
Tyr. Such conservatively substituted variations of each explicitly disclosed sequence are 
included within the mosaic polypeptides provided herein. 

236. Substitutional or deletional mutagenesis can be employed to insert sites for N- 
glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other 

10 labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, 
e.g. Arg, is accomplished for example by deleting one of the basic residues or substituting one 
by glutaminyl or histidyl residues. 

237. Certain post-translational derivatizations are the result of the action of 
recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are 

1 5 frequently post-translationally deamidated to the corresponding glutamyl and asparyl residues. 
Alternatively, these residues are deamidated under mildly acidic conditions. Other post- 
translational modifications include hydroxylation of proline and lysine, phosphorylation of 
hydroxyl groups of seryl or threonyl residues, methylation of the o-amino groups of lysine, 
arginine, and histidine side chains (T.E. Creighton, Proteins: Structure and Molecular 

20 Properties, W. H. Freeman & Co., San Francisco pp 79-86 [1983]), acetylation of the N- 
terminal amine and, in some instances, arnidation of the C-terminal carboxyl. 

238. It is understood that one way to define the variants and derivatives of the 
disclosed proteins herein is through defining the variants and derivatives in terms of 
homology/identity to specific known sequences. For example, SEQ ID NO: 8 sets forth a 

25 particular sequence of DHR96 cDNA and SEQ ID NO:7 sets forth a particular sequence of a 
DHR96 protein. Specifically disclosed are variants of these and other proteins herein disclosed 
which have at least, 70% or 75% or 80% or 85% or 90% or 95% homology to the stated 
sequence. Those of skill in the art readily understand how to determine the homology of two 
proteins. For example, the homology can be calculated after aligning the two sequences so that 

30 the homology is at its highest level. 

239. Another way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology 
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alignment algorithm of Needleman and Wunsch, J. MoL Bio!. 48: 443 (1970), by the search for 
similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (198S), by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 

5 Madison, WI), or by inspection. 

240. The same types of homology can be obtained for nucleic acids by for example the 
algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger etal. Proc. Natl. Acad. 
Sci. USA 86:7706-7710, 1989, Jaeger etal. Methods Enzymol 183:281-306, 1989 which are 
herein incorporated by reference for at least material related to nucleic acid alignment. 

10 24 1 . It is understood that the description of conservative mutations and homology can 

be combined together in any combination, such as embodiments that have at least 70% 
homology to a particular sequence wherein the variants are conservative mutations. 

242. As this specification discusses various proteins and protein sequences it is 
understood that the nucleic acids that can encode those protein sequences are also disclosed. 

15 This would include all degenerate sequences related to a specific protein sequence, i.e. all 
nucleic acids having a sequence that encodes one particular protein sequence as well as all 
nucleic acids, including degenerate nucleic acids, encoding the disclosed variants and derivatives 
of the protein sequences. Thus, while each particular nucleic acid sequence may not be written 
out herein, it is understood that each and every sequence is in fact disclosed and described herein 

20 through the disclosed protein sequence. For example, one of the many nucleic acid sequences 
that can encode the protein sequence set forth in SEQ ID NO:7 is set forth in SEQ ID NO: 8. It is 
also understood that while no amino acid sequence indicates what particular DNA sequence 
encodes that protein within an organism, where particular variants of a disclosed protein are 
disclosed herein, the known nucleic acid sequence that encodes that protein in the particular 

25 organism from which that protein arises is also known and herein disclosed and described. 

243. It is understood that there are numerous amino acid and peptide analogs which 
can be incorporated into the disclosed compositions. For example, there are numerous D amino 
acids or amino acids which have a different functional substituent then the amino acids shown in 
Table 1 and Table 2. The opposite stereo isomers of naturally occurring peptides are disclosed, 

30 as well as the stereo isomers of peptide analogs. These amino acids can readily be incorporated 
into polypeptide chains by charging tRNA molecules with the amino acid of choice and 
engineering genetic constructs that utilize, for example, amber codons, to insert the analog amino 
acid into a peptide chain in a site specific way (Thorson et al., Methods in Molec. Biol. 77:43- 
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73 (1991), Zoller, Current Opinion in Biotechnology, 3:348-354 (1992); Ibba, Biotechnology & 
Genetic Enginemng Reviews 13:197-216 (1995), Cahill et al., TIBS, 14(10):400-403 (1989); 
Benner, TIB Tech, 12:158-163 (1994); Ibba and Hennecke, Bio/technology, 12:678-682 (1994) 
all of which are herein incorporated by reference at least for material related to amino acid 
5 analogs). 

244. Molecules can be produced that resemble peptides, but which are not connected 
via a natural peptide linkage. For example, linkages for amino acids or amino acid analogs can 
include CH 2 NH— , -CH 2 S~, --CH 2 -CH 2 --, »CH=CH« (cis and trans), -COCH 2 --, - 
CH(OH)CH2— , and — CHH 2 SO — (These and others can be found in Spatola, A. F. in Chemistry 

10 and Biochemistry of Amino Acids, Peptides, and Proteins, B. Weinstein, eds., Marcel Dekker, 
New York, p. 267 (1 983); Spatola, A. F., Vega Data (March 1 983), Vol. 1 , Issue 3, Peptide 
Backbone Modifications (general review); Morley, Trends Phann Sci (1980) pp. 463-468; 
Hudson, D. et al., Int J Pept Prot Res 14:177-185 (1979) (~CH 2 NH~, CH 2 CH 2 ~); Spatola et al. 
Life Sci 38:1243-1249 (1986) (-CH H 2 ~S); Hann J. Chem. Soc Perkin Trans. 1307-314 
15 (1982) (~CH~CH~, cis and trans); Almquist et al. J. Med. Chem. 23:1392-1398 (1980) (- 
COCH 2 -); Jennings- White et al. Tetrahedron Lett 23:2533 (1982) (~COCH 2 ~); Szelke et al. 
European Appln, EP 45665 CA (1982): 97:39405 (1982) (-CH(OH)CH 2 -); Holladay et al. 
Tetrahedron. Lett 24:4401-4404 (1983)(--C(OH)CH 2 ~); and Hruby Life Sci 31:189-199 (1982) 
(~CH 2 "S~); each of which is incorporated herein by reference. A particularly preferred non- 
20 peptide linkage is -CH 2 NH— . It is understood that peptide analogs can have more than one 
atom between the bond atoms, such as b-alanine, g-aminobutyric acid, and the like. 

245. Amino acid analogs and analogs and peptide analogs often have enhanced or 
desirable properties, such as, more economical production, greater chemical stability, enhanced 
pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered specificity (e.g., 

25 a broad-spectrum of biological activities), reduced antigenicity, and others. 

246. D-amino acids can be used to generate more stable peptides, because D amino 
acids are not recognized by peptidases and such. Systematic substitution of one or more amino 
acids of a consensus sequence with a D-amino acid of the same type (e.g., D-lysine in place of L- 
lysine) can be used to generate more stable peptides. Cysteine residues can be used to cyclize or 

30 attach two or more peptides together. This can be beneficial to constrain peptides into particular 
conformations. (Rizo and Gierasch Ann. Rev. Biochem. 61:387 (1992), incorporated herein by 
reference). 

6. Pharmaceutical carriers/Delivery of pharamceutical products 
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247. As described above, the compositions can also be administered in vivo in a 
pharmaceutically acceptable carrier. By "pharmaceutically acceptable" is meant a material that is 
not biologically or otherwise undesirable, i.e., the material may be administered to a subject, 
along with the nucleic acid or vector, without causing any undesirable biological effects or 

5 interacting in a deleterious manner with any of the other components of the pharmaceutical 
composition in which it is contained. The carrier would naturally be selected to minimize any 
degradation of the active ingredient and to minimize any adverse side effects in the subject, as 
would be well known to one of skill in the art. 

248 . The compositions may be administered orally, parenterally (e.g., intravenously), 
10 by intramuscular injection, by intraperitoneal injection, transdermally, extracorporeally, topically 

or the like, including topical intranasal administration or administration by inhalant. As used 
herein, "topical intranasal administration" means delivery of the compositions into the nose and 
nasal passages through one or both of the nares and can comprise delivery by a spraying 
mechanism or droplet mechanism, or through aerosolization of the nucleic acid or vector. 

15 Administration of the compositions by inhalant can be through the nose or mouth via delivery by 
a spraying or droplet mechanism. Delivery can also be directly to any area of the respiratory 
system (e.g., lungs) via intubation. The exact amount of the compositions required will vary 
from subject to subject, depending on the species, age, weight and general condition of the 
subject, the severity of the allergic disorder being treated, the particular nucleic acid or vector 

20 used, its mode of administration and the like. Thus, it is not possible to specify an exact amount 
for every composition. However, an appropriate amount can be determined by one of ordinary 
skill in the art using only routine experimentation given the teachings herein. 

249. Parenteral administration of the composition, if used, is generally characterized by 
injection. Injectables can be prepared in conventional forms, either as liquid solutions or 

25 suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as 
emulsions. A more recently revised approach for parenteral administration involves use of a 
slow release or sustained release system such that a constant dosage is maintained. See, e.g., 
U.S. Patent No. 3,610,795, which is incorporated by reference herein. 

250. The materials may be in solution, suspension (for example, incorporated into 
30 microparticles, liposomes, or cells). These may be targeted to a particular cell type via 

antibodies, receptors, or receptor ligands. The following references are examples of the use of 
this technology to target specific proteins to tumor tissue (Senter, et al., Bioconiugate Chem. . 
2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer, 60:275-281, (1989); Bagshawe, et al., Br. J. 
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Cancer. 58:700-703, (19S8); Senter, et al., Bioconiugate Chem. . 4:3-9, (1993); Battelli, et al., 
Cancer Immunol. Immunother.. 35:421-425, (1992); Pietersz and McKenzie. Immunolog. 
Reviews . 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol. 42:2062-2065, (1991)). 
Vehicles such as "stealth" and other antibody conjugated liposomes (including lipid mediated 
5 drug targeting to colonic carcinoma), receptor mediated targeting of DNA through cell specific 
ligands, lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting 
of murine glioma cells in vivo. The following references are examples of the use of this 
technology to target specific proteins to tumor tissue (Hughes et al., Cancer Research, 49:6214- 
6220, (1989); and Litzinger and Huang. Biochimica et Biophvsica Acta . 1 104:179-187, (1992)). 

10 In general, receptors are involved in pathways of endocytosis, either constitutive or ligand 
induced. These receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated 
vesicles, pass through an acidified endosome in which the receptors are sorted, and then either 
recycle to the cell surface, become stored intracellularly, or are degraded in lysosomes. The 
internalization pathways serve a variety of functions, such as nutrient uptake, removal of 

1 5 activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, 
dissociation and degradation of ligand, and receptor-level regulation. Many receptors follow 
more than one intracellular pathway, depending on the cell type, receptor concentration, type of 
ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms of 
receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 

20 10:6,399-409(1991)). 

a) Pharmaceutically Acceptable Carriers 

251 . The compositions, including antibodies, can be used therapeutically in 
combination with a pharmaceutically acceptable carrier, 

252. Suitable carriers and their formulations are described in Remington: The Science 
25 and Practice of Pharmacy ( 1 9th ed.) ed. A.R. Gennaro, Mack Publishing Company, Easton, PA 

1995. Typically, an appropriate amount of a pharmaceutically-acceptable salt is used in the 
formulation to render the formulation isotonic. Examples of the pharmaceutically-acceptable 
carrier include, but are not limited to, saline, Ringer's solution and dextrose solution. The pH of 
the solution is preferably from about 5 to about 8, and more preferably from about 7 to about 7.5. 
30 Further carriers include sustained release preparations such as semipermeable matrices of solid 
hydrophobic polymers containing the antibody, which matrices are in the form of shaped articles, 
e.g., films, liposomes or microparticles. It will be apparent to those persons skilled in the art that 
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certain carriers may be more preferable depending upon, for instance, the route of administration 
and concentration of composition being administered. 

253. Pharmaceutical carriers are known to those skilled in the art. These most 
typically would be standard carriers for administration of drugs to humans, including solutions 

5 such as sterile water, saline, and buffered solutions at physiological pH. The compositions can 
be administered intramuscularly or subcutaneously. Other compounds will be administered 
according to standard procedures used by those skilled in the art. 

254. Pharmaceutical compositions may include carriers, thickeners, diluents, buffers, 
preservatives, surface active agents and the like in addition to the molecule of choice. 

1 0 Pharmaceutical compositions may also include one or more active ingredients such as antimicrobial 
agents, antiinflammatory agents, anesthetics, and the like. 

255. The pharmaceutical composition may be administered in a number of ways 
depending on whether local or systemic treatment is desired, and on the area to be treated. 
Administration may be topically (including ophthalmically, vaginally, rectally, intranasally), orally, 

1 5 by inhalation, or parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or 
intramuscular injection. The disclosed antibodies can be administered intravenously, 
intraperitoneally, intramuscularly, subcutaneously, intracavity, or transdermally. 

256. Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 

20 polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringers, or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishes, electrolyte replenishes (such as those based on 

25 Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, 
for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

257. Formulations for topical administration may include ointments, lotions, creams, 
gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, 
aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. 

30 258. Compositions for oral administration include powders or granules, suspensions or 

solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, 
diluents, emulsifiers, dispersing aids or binders may be desirable.. 
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259, Some of the compositions may potentially be administered as a pharmaceutically 
acceptable acid- or base- addition salt, formed by reaction with inorganic acids such as 
hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, 
and phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, glycolic 
5 acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric 
acid, or by reaction with an inorganic base such as sodium hydroxide, ammonium hydroxide, 
potassium hydroxide, and organic bases such as mono-, di-, trialky] and aryl amines and 
substituted ethanolamines. 

b) Therapeutic Uses 

10 260. ' Effective dosages and schedules for administering the compositions may be 

determined empirically, and making such determinations is within the skill in the art. The 
dosage ranges for the administration of the compositions are those large enough to produce the 
desired effect in which the symptoms disorder are effected. The dosage should not be so large as 
to cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, and the 

1 5 like. Generally, the dosage will vary with the age, condition, sex and extent of the disease in the 
patient, route of administration, or whether other drugs are included in the regimen, and can be 
determined by one of skill in the art. The dosage can be adjusted by the individual physician in 
the event of any counterindi cations. Dosage can vary, and can be administered in one or more 
dose administrations daily, for one or several days. Guidance can be found in the literature for 

20 appropriate dosages for given classes of pharmaceutical products. For example, guidance in 
selecting appropriate doses for antibodies can be found in the literature on therapeutic uses of 
antibodies, e.g., Handbook of Monoclonal Antibodies, Ferrone et al., eds., Noges Publications, 
Park Ridge, N.J., (1985) ch. 22 and pp. 303-357; Smith et al., Antibodies in Human Diagnosis 
and Therapy, Haber et al., eds., Raven Press, New York (1977) pp. 365-389. A typical daily 

25 dosage of the antibody used alone might range from about 1 ug/kg to up to 100 mg/kg of body 
weight or more per day, depending on the factors mentioned above. 
7* Chips and micro arrays 

261 . Disclosed are chips where at least one address is the sequences or part of the 
sequences set forth in any of the nucleic acid sequences disclosed herein. Also disclosed are 

30 chips where at least one address is the sequences or portion of sequences set forth in any of the 
peptide sequences disclosed herein. 

262. Also disclosed are chips where at least one address is a variant of the sequences 
or part of the sequences set forth in any of the nucleic acid sequences disclosed herein. Also 
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disclosed are chips where at least one address is a variant of the sequences or portion of 
sequences set forth in any of the peptide sequences disclosed herein. 

8. Computer readable mediums 

263. It is understood that the disclosed nucleic acids and proteins can be represented as 
5 a sequence consisting of the nucleotides of amino acids. There are a variety of ways to display 
these sequences, for example the nucleotide guanosine can be represented by G or g. Likewise 
the amino acid valine can be represented by Val or V. Those of skill in the art understand how to 
display and express any nucleic acid or protein sequence in any of the variety of ways that exist, 
each of which is considered herein disclosed. Specifically contemplated herein is the display of 

10 these sequences on computer readable mediums, such as, commercially available floppy disks, 
tapes, chips, hard drives, compact disks, and video disks, or other computer readable mediums. 
Also disclosed are the binary code representations of the disclosed sequences. Those of skill in 
the art understand what computer readable mediums. Thus, computer readable mediums on 
which the nucleic acids or protein sequences are recorded, stored, or saved. 

1 5 264. Disclosed are computer readable mediums comprising the sequences and 

information regarding the sequences set forth herein. Also disclosed are computer readable 
mediums comprising the sequences and information regarding the sequences set forth herein 
wherein the sequences do not include SEQ ID Nos: 37, 38, 39, 40, 41 , and 42. 

9. Kits 

20 265. Disclosed herein are kits that are drawn to reagents that can be used in practicing 

the methods disclosed herein. The kits can include any reagent or combination of reagent 
discussed herein or that would be understood to be required or beneficial in the practice of the 
disclosed methods. For example, the kits could include primers to perform the amplification 
reactions discussed in certain embodiments of the methods, as well as the buffers and enzymes 

25 required to use the primers as intended. 

D. Methods of making the compositions 

266. The compositions disclosed herein and the compositions necessary to perform the 
disclosed methods can be made using any method known to those of skill in the art for that 
particular reagent or compound unless otherwise specifically noted. 

30 1. Nucleic acid synthesis 

267. For example, the nucleic acids, such as, the oligonucleotides to be used as primers 
can be made using standard chemical synthesis methods or can be produced using enzymatic 
methods or any other known method. Such methods can range from standard enzymatic 
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digestion followed by nucleotide fragment isolation (see for example, Sambrook et aL, 
Molecular Cloning: A Laboratory Manual, 2nd Edition (Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., 1989) Chapters 5, 6) to purely synthetic methods, for example, by the 
cyanoethyl phosphoramidite method using a Milligen or Beckman System lPlus DNA 
5 synthesizer (for example, Model 8700 automated synthesizer of Milligen-Biosearch, Burlington, 
MA or ABI Model 380B). Synthetic methods useful for making oligonucleotides are also 
described by Ikuta etai, Ann. Rev. Biochem. 53:323-356 (1984), (phosphotriester and 
phosphite-triester methods), and Narang et al. 9 Methods EnzymoL, 65:610-620 (1980), 
(phosphotriester method). Protein nucleic acid molecules can be made using known methods 

10 such as those described by Nielsen et a/., Bioconjug. Chem. 5:3-7 (1 994). 
2. Peptide synthesis 
268. One method of producing the disclosed proteins, such as SEQ ID NO:23, is to 
link two or more peptides or polypeptides together by protein chemistry techniques. For 
example, peptides or polypeptides can be chemically synthesized using currently available 

15 laboratory equipment using either Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert 

-butyloxycarbonoyl) chemistry, (Applied Biosystems, Inc., Foster City, CA). One skilled in the 
art can readily appreciate that a peptide or polypeptide corresponding to the disclosed proteins, 
for example, can be synthesized by standard chemical reactions. For example, a peptide or 
polypeptide can be synthesized and not cleaved from its synthesis resin whereas the other 

20 fragment of a peptide or protein can be synthesized and subsequently cleaved from the resin, 
thereby exposing a terminal group which is functionally blocked on the other fragment. By 
peptide condensation reactions, these two fragments can be covalently joined via a peptide bond 
at their carboxyl and amino termini, respectively, to form an antibody, or fragment thereof. 
(Grant GA (1992) Synthetic Peptides: A User Guide. W.H. Freeman and Co., N.Y. (1992); 

25 Bodansky M and Trost B., Ed. (1993) Principles of Peptide Synthesis. Springer- Verlag Inc., NY 
(which is herein incorporated by reference at least for material related to peptide synthesis). 
Alternatively, the peptide or polypeptide is independently synthesized in vivo as described 
herein. Once isolated, these independent peptides or polypeptides may be linked to form a 
peptide or fragment thereof via similar peptide condensation reactions. 

30 269. For example, enzymatic ligation of cloned or synthetic peptide segments allow 

relatively short peptide fragments to be joined to produce larger peptide fragments, polypeptides 
or whole protein domains (Abrahmsen L et al., Biochemistry, 30:41 51 (1991)). Alternatively, 
native chemical ligation of synthetic peptides can be utilized to synthetically construct large 
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peptides or polypeptides from shorter peptide fragments. This method consists of a two step 
chemical reaction (Dawson et al. Synthesis of Proteins by Native Chemical Ligation. Science, 
266:776-779 (1994)). The first step is the chemoselective reaction of an unprotected synthetic 
peptide— thioester with another unprotected peptide segment containing an amino-terminal Cys 
5 residue to give a thioester-linked intermediate as the initial covalent product. Without a change 
in the reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular 
reaction to form a native peptide bond at the ligation site (Baggiolini M et al. (1992) FEBS Lett. 
307:97-101; Clark-Lewis I et al., J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., 
Biochemistry, 30:3128 (1991); Rajarathnam K et al., Biochemistry 33:6623-30 (1994)). 

10 270. Alternatively, unprotected peptide segments are chemically linked where the bond 

formed between the peptide segments as a result of the chemical ligation is an unnatural 
(non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)). This technique has been 
used to synthesize analogs of protein domains as well as large amounts of relatively pure 
proteins with full biological activity (deLisle Milton RC et al., Techniques in Protein Chemistry 

15 rV. Academic Press, New York, pp. 257-267(1992)). 

3. Processes for making the compositions 

27 1 . Disclosed are processes for making the compositions as well as making the 
intermediates leading to the compositions. For example, disclosed are nucleic acids and proteins 
in SEQ ID NOs: 1-60. There are a variety of methods that can be used for making these 

20 compositions, such as synthetic chemical methods and standard molecular biology methods. It is 
understood that the methods of making these and the other disclosed compositions are 
specifically disclosed. 

272. Disclosed are nucleic acid molecules produced by the process comprising linking 
in an operative way a nucleic acid comprising the sequence set forth herein and a sequence 

25 controlling the expression of the nucleic acid. 

273. Also disclosed are nucleic acid molecules produced by the process comprising 
linking in an operative way a nucleic acid molecule comprising a sequence having 80% identity 
to a sequence set forth in herein, and a sequence controlling the expression of the nucleic acid. 

274. Disclosed are nucleic acid molecules produced by the process comprising linking 
30 in an operative way a nucleic acid molecule comprising a sequence that hybridizes under 

stringent hybridization conditions to a sequence set forth herein and a sequence controlling the 
expression of the nucleic acid. 



— 75 — 



Attorney Docket Number 21 1OI .0053UI 

275. Disclosed are nucleic acid molecules produced by the process comprising linking 
in an operative way a nucleic acid molecule comprising a sequence encoding a peptide set forth 
in SEQ ID NO:7 and a sequence controlling an expression of the nucleic acid molecule, 

276. Disclosed are nucleic acid molecules produced by the process comprising linking 
5 in an operative way a nucleic acid molecule comprising a sequence encoding a peptide having 

80% identity to a peptide set forth in herein and a sequence controlling an expression of the 
nucleic acid molecule. 

277. Disclosed are nucleic acids produced by the process comprising linking in an 
operative way a nucleic acid molecule comprising a sequence encoding a peptide having 80% 

10 identity to a peptide set forth in herein, wherein any change from the herein are conservative 
changes and a sequence controlling an expression of the nucleic acid molecule. 

278. Disclosed are cells produced by the process of transforming the cell with any of 
the disclosed nucleic acids. Disclosed are cells produced by the process of transforming the cell 
with any of the non-naturally occurring disclosed nucleic acids. 

1 5 279. Disclosed are any of the disclosed peptides produced by the process of expressing 

any of the disclosed nucleic acids. Disclosed are any of the non-naturally occurring disclosed 
peptides produced by the process of expressing any of the disclosed nucleic acids. Disclosed are 
any of the disclosed peptides produced by the process of expressing any of the non-naturally 
disclosed nucleic acids. 

20 280. Disclosed are animals and invertebrates produced by the process of transfecting a 

cell within the animal or invertebrate with any of the nucleic acid molecules disclosed herein. 
Disclosed are animals or invertebrates produced by the process of transfecting a cell within the 
animal any of the nucleic acid molecules disclosed herein, wherein the animal is a mammal 
invertebrate is an insect, such as drosophila. Also disclosed are animals produced by the process 

25 of transfecting a cell within the animal any of the nucleic acid molecules disclosed herein, 
wherein the mammal is mouse, rat, rabbit, cow, sheep, pig, or primate. 

281 . Also disclose are animals produced by the process of adding to the animal any of 
the cells disclosed herein. 

E. Methods of using the compositions 
30 1. Methods of using the compositions as research tools 

282. The disclosed compositions can be used in a variety of ways as research tools. 
For example, the disclosed compositions, such as molecules disclosed herein can be used to 
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study the interactions between the molecules, and for example, their ligands or other compounds, 
by for example acting as inhibitors of binding. 

283. The compositions can be used for example as targets in combinatorial chemistry 
protocols or other screening protocols to isolate molecules that possess desired functional 

5 properties related to inhibiting DHR96 activity, for example. 

284. The disclosed compositions can be used as discussed herein as either reagents in 
micro arrays or as reagents to probe or analyze existing microarrays. The disclosed compositions 
can be used in any known method for isolating or identifying single nucleotide polymorphisms. 
The compositions can also be used in any method for determining allelic analysis of for example, 

10 DHR96, particularly allelic analysis as it relates to xenobiotic pathway functions. The 

compositions can also be used in any known method of screening assays, related to chip/micro 
arrays. The compositions can also be used in any known way of using the computer readable 
embodiments of the disclosed compositions, for example, to study relatedness or to perform 
molecular modeling analysis related to the disclosed compositions. 

15 F. Examples 

285. The following examples are put forth so as to provide those of ordinary skill in 
the art with a complete disclosure and description of how the compounds, compositions, articles, 
devices and/or methods claimed herein are made and evaluated, and are intended to be purely 
exemplary and are not intended to limit the disclosure. Efforts have been made to ensure 

20 accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and 
deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, 
temperature is in °C or is at ambient temperature, and pressure is at or near atmospheric. 

1. Example 1 The DHR96 nuclear receptor is required for xenobiotic 
responses in Drosophila 
25 a) Materials and Methods 

(1) Construction of the DHR96 targeting fragment 

286. A 7.55 kb DNA fragment that contains a mutated version of the Drosophila 
melanogaster DHR96 gene was generated by introducing two deletions: (1) deleting sequences 
harboring the start site (26 bp) and (2) deleting the fourth exon and intron (331 bp) from the wild 

30 type sequence. In addition, a recognition site for the restriction enzyme I-Sce I was inserted into 
the center (cuts between position 3699 and 3700) of the 7.55 kb fragment (see fig. Ml). To 
obtain a genomic clone DNA of the PI clone 26-95 that harbored the complete DHR96 gene was 
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isolated (provided by BDGP: http://www.fruitfly.org/). The assembly of the 7.55 kb targeting 
sequence was achieved by fusing three fragments: 

(a) Fragment 1 A 1.958 kb Apa I-Hind III fragment 

287. This was isolated by cutting P 1 26-95 with Hind 01 and isolating a 6.599 kb Hind 
5 EH fragment, which then was cut with Apa I and Sgr AI. The 1 .958 kb Apa I - Hind III fragment 

was cloned into Litmus 38 (New England BioLabs) (cut with Apa I and Hind III). 

(b) Fragment 2 A 4.325 kb fragment 

288. This fragment contains the actual mutations and forms the core of the targeting 
construct. It was generated by using three pairs of PCR primers (for sequences, see oligos): (I) 

10 FAPA96 and R96EX3Sce, (II) F96Int3Sce and R96Int3, (HI) F96Ex5Int3 and R96EndHind. The 
PI 26-95 genomic clone served as a template. Primer pair (I) produced a 1724 bp fragment, 
primer pair (H) a 993 bp fragment and primer pair (III) a 1 650 bp fragment. The 993 bp and the 
1650 bp fragments were fused in a PCR reaction using the primers F96Int3Sce and R96EndHind, 
generating a 2.62 kb fragment. Likewise, the 1724 bp and the 993 bp fragments were fused using 

1 5 the FAPA96 and R96Int3 primers to form a 2.70 kb fragment. In a final step, the 2.70 and the 
2.62 kb fragments were fused using the primers FAPA96 and R96EndHind to form the 
aforementioned 4.325 kb fragment, which was cloned into PCR TOPO 2.1 (Invitrogen). 

(c) Fragment 3 A 1.86 kb PCR fragment 

289. Fragment 3 was generated using the primers F96Xma and R96SpeBgl, with the 
20 PI 26-95 clone as a template. The fragment was eluted and cut directly with Xma I and Spe I. 

290. The 1 .86 kb PCR fragment was cloned into the PCR Topo 2.1 vector (Invitrogen) 
containing the 4.325 kb, which was cut with Xma I and Spe I. The resulting clone was cut with 
Apa I and Spe I and fused to the 1.958 kb fragment, which had been previously isolated from 
Litmus 38 (New England Biolabs) with Apa I and Spe I. The resulting clone is the 7.55 kb 

25 targeting fragment. A sequence printout and annotation of this fragment is included (SEQ ID 
NO:37). 

(2) Construction of the hs-Ga!4-DHR96 fusion gene 

291. A fusion of the Gal4 DNA binding domain (amino acids 1 to 147) and the 
DHR96 hinge region and ligand binding domain (LBD) (amino acids 99 to 723) was generated 

30 to create a Gal4-LBD fusion protein. Two PCR fragments were generated: (I) a 475 bp fragment 
using the primers FGALXB and RGAL96 and a Gal4 containing plasmid as a template. (II) 
F96BEG and R96/936 generate a 372 bp fragment from pLF20N, which contains the DHR96 
cDNA (Fisk and Thummel, 1995). Fragments (I) and (H) possess a 15 bp overlap that was then 
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utilized to fuse them by PCR. The resulting 832 bp fragment was cut with Xba I and Age I and 
cloned into pLF20N, which had been cut with the same enzymes to remove the DHR96 DNA- 
binding domain. The resulting plasmid is termed pGAL96 . To obtain the final transformation 
vector, the Gal4-DHR96 fusion gene was isolated from pGAL96 with Not I and Nhe I and 
5 ligated to pCASPER hs-act cut with Xba I and Not I (SEQ ID NO:38, (see Seq 2 for the 
sequence of the insert in this vector, encoding the Gal4-LBD fusion). 

(3) Construction of the hs-DHR96 RNAi vector 

292. An inverted repeat sequence that corresponds to a part of the coding region for the 
DHR96 ligand-binding domain (each repeat corresponds to nucleotides 1444-2371 of the 

10 DHR96 plasmid pLF20N; Fisk and Thummel, 1995) was generated. The repeats are separated 
by a unique spacer region of 101 bp that corresponds to nucleotides 2372-2472 of the same 
DHR96 cDNA. Two primer pairs were used: (I) F96Xbai and R96BspEl and (II) F96Xbai and 
R96BspE2. Both fragments were cut with Bsp EI and ligated. The ligated fragment was purified 
and cut with Xba I and cloned into Litmus 28 (New England Biolabs) cut with Xba I. After the 

15 cloned fragment (1956 bp) was verified by restriction analysis, it was excised with Xba I and f 
inserted into pCasper hs-act cut with Xba I. 

(4) Construction of the hs-DHR96 vector and fly 
transformation 

293. This vector produces wild type DHR96 protein under the control of an hsp70 
20 promoter in a transgenic animal. A full length cDNA was excised from the plasmid pLF20N 

with the restriction enzymes Not I and Nhel and cloned it into pCasper hs-act vector cut with Not 
I and Xba I. Transformant flies were isolated using standard methods (Rubin GM, Spradling 
AC. Genetic transformation ofDrosophila with transposable element vectors. Science. 1982 
Oct 22;21 8(4570):348-53). 
25 (5) Construction of pET24c-DHR96 

294. To generate antibodies, DHR96 antigen was produced from a 1 .8 kb EcoRV 
fragment (597 amino acids), which includes most of the cDNA, but excludes the DNA binding 
domain. The 1 .8 kb Eco RV fragment was isolated from pLF20, a plasmid that contains a full 
length DHR96 cDNA (pLF20 differs from pLF20N in the following: pLF20 was cut with 

30 Hindffl, filled in, and religated to create a unique Nhe I site. The new plasmid was termed 

pLF20N). pET24c (Novagen) was cut with Bam HI and Xho I and blunt ends were generated by 
fill-in, and subsequently the Eco RV fragment was cloned into this vector. Orientation was tested 
using restriction analysis. A sequence printout of this clone is included (SEQ ID NO:39Seq. 3). 
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(6) Construction of pMAL-DHR96 

295. To purify antisera, soluble DHR96 protein was produced by fusing the original 
antigen to the Maltose-binding protein. To subclone the Eco RV fragment of DHR96 (the 
original antigen coding section) into pMAL-c2X (New England Biolab), a fragment from 
5 pET24c-DHR96 was PCR amplified by using the primer pair F96ANhe and R96AHind. The 
fragment was cut directly with Nhe I and HindlH and cloned into pMAL-c2X cut with Xba I and 
HindlD. 

(7) Oligonucleotides 



Oligonucleotides 



SEQ ID 
NO:40 


F96Xma 


S'-GAGAGATGTGCTTCGTTAAAGCATCAACCC 


SEQ ID 
NO;41 


R96SpeBgl 


5-GGACTAGTAGATCTAGAGGATTCTACAAATGTCCAGTGTCTCCC 


SEQ ID 
NO:42 


R96Int3 


5-CCATTATTATCGCCATAATCGTAAAGG 


SEQ ID 
NO:43 


R96EX3SCE 


S'-ATTACCCTGTTATCCCTAGCGGGTTACCTTAATGCGATCATCGCCC 


SEQ ID 

NO:44 


R96endhind 


5M3GAAAGCTTTTCCTGCTGATCAATAATACC 


SEQ ID 
NO:45 


FA PA 96 


5'-TGGGCCCATCACTTGCTTGTAACCGCCGAAGAACTGCGCGG 


SEQ ID 
NO:46 


F96INT3SCE 


5' 

CGCTAGGGATAACAGGGTAATAACAGTCCACGGTATTAGCCTATAGG 


SEQ ID 
NO:47 


F96EX5Int3 


5* 

CGATTATGGCGATAATAATGGCCAAAGAGAACATGGGCAACATACGC 


SEQ ID 
NO:48 


FGALXB 


5'-GAAGCAAGCCTCTAGAAAGATGAAGC 


SEQ ID 
NO:49 


RGAL96 


S^GTGCCGTTCrCCATCGATACAGTCAACTGTCTTTGACC 


SEQ ID 

NO:50 


R96/936 


S'-GCCTGGATAGTCGATCAAATGCG 


SEQ ID 
NO:51 


F96BEG 


5-ATGGAGAACGGCACGGATGC 


SEQ ID 

NO:52 


F96XBAi 


5-TACATTCTAGAGACCAACTACAACGACGAGCCCAGTCTGG 


SEQ ID 
NO:53 


R96BspEl 


5'^ATTCATCCGGACATTAATTATGAACTTGTTCAGACGCTCC 


SEQ ID 
NO:54 


R96BspE2 


S'-GGGCATCAACTCCGGAATTAAATGCCCGACACGCATCGG 


SEQ ID 
NO:55 


RPAXCRE-AN 


S'^TCTCACGACGTTTTGAACCCAGAAATCGAGCTCGCCCGGGG 


SEQ ID 
NO:56 


RPAXCRECO 


S'-CACGAATTCCAAACTGTCTCACGACGTTTTGAACCC 


SEQ ID 
NO:57 


FPAXFSE-AN 


S'-GAGAGCTAGCATGCCGGCTAGATCTCGAGATCGGCCGGCCTAGG 


SEQ ID 

NO:58 


FPAXPOLY 


S'-GAACTGCAGCTCGAGAGCTAGCATGCCGGC 


SEQ ID 
NO:59 


F96ANhe 


5*-GGAGATATACATATGGCTAGCATGACTGGTGG 


SEQ ID 
NO:60 


R96AHind 


S'-TGCTCGAAGCTTCGCAGAAGATAATAGTAGG 
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(8) DHR96 gene targeting 

296. The 7.55 kb genomic fragment containing a mutated DHR96 gene (see above) 
was inserted into the Drosophila genome as described (Rong YS, Golic KG. Gene targeting by 

5 homologous recombination in Drosophila. Science. 2000 Jun 16;288(5473):2013-8). w; [hsp70- 
FLP]4 [hsp70 1 See I]2b Sco/S2 CyO females were crossed to w; [<(96TG GFP+> w+] males 
that carried the targeting fragment on the second chromosome. Larvae were heat shocked during 
the third larval instar to trigger targeting events in the germline of females. [hsp70-FLP]4 [hsp70 
I See I]2b Sco/ [<(96TG GFP+> w+] females were then collected and crossed them to w; 
10 Serl/TM6B, Tb males. 918 vials of such crosses (5 males and 10 females) were set up which 
generated approximately 150,000 flies that were screened for GFP+, but white-eyed individuals. 
These flies were crossed to wl 1 1 8; Ly/TM6C Tb Sb, and stocks were subsequently established 
from a single chromosome. The DHR96E25 allele was isolated from one of these stocks. 

(9) Reduction of the DHR96 targeted event to a single copy by 
15 I-Crel 

297. Males carrying the tandem duplication allele (wl 1 1 8/Y; DHR96E25/DHR96E25) 
were mated to v hsp70 Crel; Sb/TM6 females in mass. After 3 days at 25°C, the parental flies 
were removed and the progeny were heat-treated at 36°C for one hour to induce Crel 
recombinase. Males that eclosed were individually mated to wl 1 18; Ly/TM6C females. One 

20 male progeny (wl 1 1 8/Y; DHR96Cre reduced/TM6C) that had lost GFP expression (indicating a 
recombination event had occurred) was selected from each vial and individually mated to 
wl 1 18; Ly/TM6C females to establish a stock containing the reduced allele (Rong and Golic 
2002). Mutant strains were characterized by Southern blotting, PCR, and DNA sequencing using 
standard methods. The DHR9616A mutant stock was selected for further characterization. 

25 (10) Tissue antibody stains 

298. Wandering third instar larval tissues were dissected and fixed as previously 
described (Boyd, L., O'Toole, E. and Thummel, C.S. (1991). Patterns of E74A RNA and protein 
expression at the onset of metamorphosis in Drosophila. Development 112, 981-995). DHR96 
protein was detected with anti-DHR96 antibodies diluted 1 : 100 and incubated overnight at 4 °C. 

30 Donkey anti-rabbit CY3 secondary antibodies (Jackson) were used at a 1 :200 dilution as a 

secondary antibody. The stains were visualized on a Biorad confocal laser scanning microscope. 
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(11) Western blots analysis 

299. Protein from adult flies was extracted by grinding flies in SDS sample buffer and 
boiling. The equivalent of approximately one adult fly was loaded in each lane of an 8% 
polyacrylamide gel, separated by electrophoresis and transferred to PVDF membrane. 

5 Ectopically expressed DHR96 protein was produced by heat-treating flies at 37.5 o C for 30 
minutes followed by a three hour recovery at room temperature before the extraction procedure. 
DHR96 protein was detected by incubating the membrane first with a 1 :500 dilution of anti- 
DHR96 affinity purified antibodies followed by a 1 : 1 000 dilution of goat anti-rabbit HRP 
secondary antibody (Pierce). A supersignal chemiluminescence kit was used to develop the 
10 signal (Pierce). 

(12) Toxicity assays 

300. Adult flies were raised on standard cornmeal/agar food and starved overnight 
under humid conditions at 25 0 C before treatment with DDT. A DDT stock solution was 
prepared by dissolving crystalline DDT (Sigma) in 100% ethanol. Appropriate DDT dilutions 

15 were made by diluting the DDT stock with 5% sucrose and pipetting 275 jil of the solution onto 
a strip of Whatman filter paper inside a small glass scintillation vial. Twenty adult flies were 
placed in each vial which was plugged with cotton. Mortality was scored 10 hours later at room 
temperature. For each DDT concentration, three replicates, each of twenty adult flies, were used. 
For the time course assay, 100 ng/^1 of DDT was used and mortality scored every hour for 10 

20 hours. 

b) Results 

(1) DHR96 is closely related to known xenobiotic receptors 

301. The phylogenetic relationship of DHR96 to other nuclear receptors was 
investigated for information related to function. When performing a BLASTP search, the closest 

25 homolog to DHR96 in vertebrates is the Vitamin D3 Receptor (VDR). The Pregnane X 

Receptor (PXR) as well as the Constitutively Androstane Receptor (CAR) comprise other high 
scoring homologs. (Fig. 1). 

(2) DHR96 is expressed in the alimentary canal, the salivary 
glands and the fat body 

30 302. Antibody stains of third instar larvae were used to analyze whether DHR96 would 

be expressed in tissues that function in detoxification. DHR96 antibodies strongly stain tissues 
of the alimentary canal (Fig. 2). In particular, the gastric caeca, the major site of absorption in 
Diptera, show a much stronger staining than the remainder of the midgut, which also plays a role 
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in nutrient absorption. Strong expression in the Malpighian tubules, the principal excretory 
organ in insects, was also observed. The excretory system maintains homeostasis, controlling 
salt levels and osmotic pressure, but is primarily responsible for the removal of harmful 
metabolites such as nitrogenous wastes derived from purine metabolism, or toxic compounds 
5 that were absorbed from the food. Outside the alimentary canal, strong staining in the salivary 
gland and the fat body were detected. The insect fat body is the functional equivalent of the 
mammalian liver, because it is the principal site of intermediary metabolism and detoxification. 
Taken together, the finding that DHR96 expression is tightly associated with tissues known to be 
involved in detoxification provides strong support for the proposal that DHR96 functions in a 
10 xenobiotic pathway. 

(3) DHR96 function is dispensable under standard conditions 

303. RNA interference (RNAi) and gene targeting were used to disrupt DHR96 
function because no existing mutants were available. The effects of DHR96 RNAi were 
analyzed by generating transgenic lines that express snapback RNA under the control of a heat- 

1 5 inducible promoter. Three independent lines showed strong reduction of DHR96 mRN A in 
northern blots when treated with a single heat-shock, but displayed no discemable phenotype. 
Using a variety of heat-shock regimens, e.g. longer single and double treatments or 12 hr 
repetitions, did not affect the outcome of this observation. These findings suggest that DHR96 
mRNA is not necessary for viability under standard conditions, indicating either that DHR96 

20 protein is very stable or dispensable for survival. 

304. Gene targeting (Rong, Y. S., and Golic, K. G. (2000). Science 288, 2013- 
201 8) was used to generate mutations in DHR96 because no deficiencies or P elements were 
known in this region of the genome. As a first step, the gene targeting procedure requires 
classical P-element transformation in order to generate transgenes that harbor the targeting 

25 sequence flanked by FRT sites. The targeting DNA is then mobilized and turned into a linear, 
recombinogenic molecule in vivo by activating the FLP recombinase and the endonuclease / See 
L As a consequence of this targeting technique, which is based on an "ends-in" mechanism, the 
resulting mutation is basically a replacement of the original gene with a tandem duplication of 
two mutant copies (Fig. 3). Mutations were engineered in such a way that both copies would 

30 result in non-functional gene products. In particular, a region around the translation start site 
(25 bp), and the complete sequence of exon four was deleted, the downstream intron, and the 
splice acceptor site at exon 5 (together -300 bp). These mutations should lead to a block in 
translation initiation as well as removal of most of the ligand binding domain of the receptor. 
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We constructed a targeting vector that contained two eye markers: pcix6-EGFP and mini-w///7e. 
Once mobilized by the FLP recombinase, the EGFP gene separates physically from the mini- 
white gene, which lies outside the FRT sites. Consequently, the subsequent strategy employed to 
identify potential targeting events is based on the presence of the EGFP marker and the 
5 simultaneous absence of the mini-w//i/e marker in the eye. 

305. In a screen of -1 50,000 flies, a total of 42 events were detected. Of these, 1 8 
mapped to the third chromosome, which harbors the DHR96 gene. At least one of the 1 8 events 
was identified as a targeting event in the DHR96 gene, and we termed this allele DHR96 E2S . To 
avoid problems that might arise from the truncated protein in the DHR96 E25 mutant, we decided 

10 to reduce the existing duplication to one mutant copy by utilizing the / Cre I site that was built 
into the targeting vector, essentially following the procedure described by (Rong, Y. et al., 
(2002) Genes Dev 16, 1568-1581). This procedure yielded a new DHR96 allele, DHR96 ,6A 7 
which, based on sequence and western analysis, constitutes a protein null. Several lines of 
evidence suggest that these alleles represent specific targeting events in the DHR96 gene. First, 

1 5 genomic Southern blots of animals homozygous for the targeting events displayed the predicted 
fragment patterns of a tandem duplication (DHR96 E25 ) or a reduced single copy (DHR96 ,6A ), 
Second, northern analysis revealed the absence of the wild type mRNA in the mutant animals. 
Third, antibody stains and Western analysis show a strong reduction or absence of the DHR96 
protein in DHR96 I6A ot DHR96 E25 flies (add fig for this). Fourth, Southern blot hybridization and 

20 sequencing of PCR products demonstrated that exon/intron 4 of wild type DHR96 is absent in 
homozygous DHR96 l6A or DHR96 E25 animals. 

306. Flies homozygous for DHR96 E2S or DHR96 ,6A are viable and fertile when grown 
on standard commeal food. However, when placed on instant food (Carolina 424) in the absence 
of yeast, viability decreases to about 1%, whereas wild type flies do comparably well with a 

25 survival rate of -35% compared to standard food. Interestingly, the addition of yeast restores 
viability to 100%. This suggests that either DHR96 is required for the proper execution of 
certain nutritional pathways, or that DHR96 E25 larvae fail to neutralize toxic metabolites that are 
produced when animals are reared on nutritionally poor media. To test the possibility that 
DHR96 mutants have a decreased tolerance for toxins, it was determined whether DHR96 is 

30 expressed in tissues that are known to play critical roles in the detoxification process. 
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(4) DHR96 mutants display reduced viability in the presence 
of DDT 

307. As a test of DHR96 acting in a xenobiotic pathway, DHR96 mutants were tested 
for sensitivity to the pesticide DDT. Adult wild type flies (Canton S) and DHR96 I6A were 

5 exposed or DHR96 E25 flies to varying concentrations of DDT and recorded survival rates after a 
fixed time. The findings showed that DHR96 mutants were more sensitive to DDT and died at 
lower concentrations of DDT compared to control animals (Fig. 4 A). In addition, when 
challenged with a fixed concentration of DDT, DHR96 homozygotes died more rapidly than wild 
type flies (Fig. 4B). Taken together, these results indicated that DHR96 is required for natural 
10 resistance levels to the pesticide DDT, and that DHR96 functions in a xenobiotic response 
pathway. 

(5) Overexpression of DHR96 has no effect on viability 

308. Most nuclear receptors cause lethality when overexpressed, indicating that these 
proteins do not require an obligatory ligand for some or even all of their functions. To analyze 

1 5 whether DHR96 would disrupt essential pathways and cause lethality when expressed 

ectopically, a transgenic line that harbored a full-length DHR96 cDNA under the control of a 
heat- inducible promoter was produced. Western and Northern analysis showed that heat-treated 
larvae and flies carrying this construct generated at least 100 times more DHR96 mRNA and 
protein than wild type flies lacking the transgene. Nevertheless, overexpression of this protein 

20 did not result in any visible effect, suggesting two possible scenarios: (I) DHR96 activity 

requires binding to a ligand or a protein partner, or (II) DHR96 target genes do not function in 
vital pathways, at least not under standard laboratory conditions. Naturally, both possibilities 
may be true. Microarray experiments were used to dissect how DHR96 might function on the 
molecular level. 

25 c) Microarray experiments 

309. As a first step toward identifying target genes regulated by DHR96, the protein 
was overexpressed in larvae and analyzed its effects on gene expression by microarray analyzed. 
Afirymetrix oligonucleotide chips designed to detect -13,200 genes (the majority in the fly 
genome) were used, the raw data with dCHIP (Li C, Wong WH. Model-based analysis of 

30 oligonucleotide arrays: expression index computation and outlier detection. Proc Natl Acad Sci 
USA. 2001 Jan 2;98(l):31-6; Li, C, and Wong, W. H. (2001) Genome Biol 2, 0032.1- 
0032. 1 1 ; http://www.dchip.org/) was analyzed, and filtering with Microsoft Access was 
performed. After rigorous filtering, only 71 genes remained that had a higher than 1 .8-fold 
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change when compared to the controls. Interestingly, of the top 20 reduced genes, six are 
members of all four major detoxification gene families (Table 3), which comprise a total of 198 
members in Drosophila, This represents a highly significant result (p=2.8xl0 27 , based on x*), 
because the chances of picking 6 of these genes in a random sample of 20 genes are more than 
5 20-fold lower than the observed number. Interestingly, no such concentration of genes encoding 
detoxifying enzymes exists on the list of induced genes, suggesting that DHR96 may repress 
these genes in the absence of suitable ligands. 
2. Example 2 

a) GAL4-DHR96/LBD experiments 

10 3 1 0. To determine if DHR96 is activated by the pesticide DDT the methods disclosed 

herein can be used. Flies containing two different transgenes will be mated together allowing us 
to directly assay for DHR96 LBD activation in vivo (for detailed methods and description of 
vectors see: (Kozlova, T., and C.S. Thummel (2003) Methods to characterize Drosophila 
nuclear receptor activation and function in vivo. In: "Methods in Enzymology. Nuclear 

15 Receptors, Vol. 364 (Russell, D.W., and Mangelsdorf, D.J., eds.), Academic Press, New York, 
pp. 475-490.)). One transgene is under the control of a heat-inducible promoter and contains the 
GAL4 DNA binding domain fused to the DHR96 ligand binding domain . The second transgene 
contains a GAL4-dependent GFP or lacZ reporter gene (Kozlova, T., and C.S. Thummel (2003) 
Methods to characterize Drosophila nuclear receptor activation and function in vivo. In: 

20 "Methods in Enzymology. Nuclear Receptors, Vol. 364 (Russell, D. W., and Mangelsdorf, DJ., 
eds.), Academic Press, New York, pp. 475-490.)). Upon heat induction, GAL4-DHR96 LBD 
protein can bind to the UAS-GFP or UAS-lacZ reporter. In the absence of a ligand, the reporter 
will not be activated; however, in the presence of a ligand, the GAL4 DHR96 LBD protein can 
be switched into an active conformation and induce reporter gene expression (Kozlova, T., and 

25 C.S. Thummel (2003) Methods to characterize Drosophila nuclear receptor activation and 

function in vivo. In: "Methods in Enzymology. Nuclear Receptors, Vol. 364 (Russell, D.W., 
and Mangelsdorf, D. J., eds.), Academic Press, New York, pp. 475-490.); Kozlova, T. and 
Thummel, C.S. (2002). Spatial patterns of ecdysteroid receptor activation during the onset of 
Drosophila metamorphosis. Development 129, 1739-1750). 

30 31 1 . To determine if drugs, such as DDT, can activate the DHR96 GAL4-LBD 

construct, two developmental stages will be tested. First, organs from late third instar larvae that 
have both transgenes will be dissected and cultured in the presence of several different 
concentrations of drug and assayed for reporter gene expression. Second, if activation of the 
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GAL4-LBD construct by drug requires either ingestion of the toxin or contact with the cuticle of 
the fly, adults will be heat-shocked to induce the GAL4-LBD construct, placed in scintillation 
vials containing drug, as previously above in the toxicity assays, and assayed for induction of 
reporter gene expression in adult tissues. Changes in the activity of the reporter gene in the 
5 presence, but not the absence, of drug will be an indication that that compound is having a direct 
effect on the activity state of the DHR96 LBD. 

312. Disclosed are systems that can identify ligands, such as hormones, for nuclear 
receptors, such as drosophila nuclear receptors. There are many members of the nuclear receptor 
superfamily for which there is no known ligand - the so called orphan nuclear receptors. It is 

10 desirable to link these receptors to a ligand if it exists. 

313. One way of identifying ligands for nuclear receptors involves expressing a fusion 
of the GAL4 DNA binding domain to a nuclear receptor ligand binding domain (LBD), in 
combination with a GAL4-reponsive reporter gene. The fusion protein is inactive unless its 
hormone is present, allowing it to switch into an active conformation and turn on the GAL4- 

15 responisve reporter, such as a lacZ report giving a color readout. In one variation of this method, 
which has been widely exploited by pharma companies for high throughput screens, stably 
transfected tissue culture cells of different cell types are used for the cell background to perform 
the assay. One way to do this assay would be use every tissue in the animal as a context for 
screening for hormones, not just a tissue culture cell where the appropriate cofactors or partner 

20 transcription factors might be missing, because presumably every cell has a different molecular 
background. 

314. One method used to get around this problem in mice is disclosed in WO 00/17334 
for "Analysis of ligand activated nuclear receptors (in vivo)" by Solomon et al. (See also, 
Solomin, L., et al., (1998). Nature 395, 398-402). This system was designed for the mouse, 

25 because the GAL4 system of linking the GAL4 DBD to a particular LBD works poorly in mouse. 

3 1 5. Disclosed herein is a system for drosophila for identifying ligands for nuclear 
receptors, where the GAL4 system works very well for driving tissue- and stage-specific ectopic 
gene expression. The system typically utilizes a heat-inducible promoter to widely express the 
GAL4-LBD fusion proteins, but any inducible promoter can be used. This allows monitoring of 

30 activation in all tissues both spatially and temporally. The pattern of lacZ expression in animals 
so transformed allows visualization of where and when a particular LBD is active during 
development, guiding one towards possible sources of hormone. 
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3 1 6. This has been used to show the patterns of GAL4-EcR and G AL4-USP activation 
during the onset of metamorphosis accurately reflect what would be expected for regulation of 
EcR/USP by its hormone, 20-hydroxyecdysone (Kozlova, T. and Thummel, C.S. (2002). Spatial 
patterns of ecdysteroid receptor activation during the onset of Drosophila metamorphosis. 
5 Development 129, 1739-1750). Spatial patterns of ecdysteroid receptor activation during the 
onset of Drosophila metamorphosis. Development 129, 1739-1750). This system has also been 
used to show that an orphan nuclear receptor, DHR38, is activated by a unique set of 
ecdysteroids in the animal (Baker, K. D., et al., (2003). The Drosophila orphan nuclear receptor 
DHR38 mediates an atypical ecdysteroid signaling pathway. Cell 1 13, 73 1-742). 

10 317. Disclosed herein are hsp70-G AL4-LBD transformants for all 1 8 Drosophila 

nuclear receptors. The activation patterns of these constructs have been characterized during 
embryogenesis and the onset of metamorphosis. These constructs can be used with a UAS-GFP 
reporter to simplify the readout of activation, paving the way for compound screens. 

3 1 8. These constructs can be used to screen compounds for ligand activity. For 

1 5 example, a collection of pesticides can be found in the Agro plate (see 

http://www.msdiscovery.com). Other plates can also be found at Micro Source Discovery, and 
are herein incorporated by reference at least for compound libraries and their contents. They also 
list plates of available collections of natural compounds. 

20 319. Disclosed are methods for screening for the presence of xenobiotic receptor 

ligands using the constructs and methods disclosed herein, such as those for the GAL4-DHR96 
fusions. 
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H. Sequences 

1. SEQ ID NO: 1 Accession No. NM_130611 Drosophila melanogaster 
CG16902-PA 

25 

MTLSRGPYSELDKMSLFQDLKLICRRKrDSRCSSDGESIADTSTS 
SPDLLAPMSPKLCDSGSAGASLGASLPLPLALPLPMALPLPMSLPLPLTAASSAVTVS 
LAAVVAAVAETGGAGAGGAGTAVTASGAGPCVSTSSTTAAAATSSTSSLSSSSSSSSS 
TSSSTSSASPTAGASSTATCPASSSSSSGNGSGGKSGSIKQEHTEIHSSSSAISAAAA 
30 STVMSPPPAEATRSSPATPEGGGPAGDGSGATGGGNTSGGSTAGVAINEHQNNGNGSG 
GSSRASPDSLEEKJ>STTTTTGRPTLTPTO^ 

SVKEERLLNVSSKMLVFHQQREQETKAVAAAAAAAAAGHVTVLVTPSR1KSEPPPPAS 

PSSTSSTQRERERERDRERDRERERERDRDREREREQS1SSSQQHLSRVSASPPTQLS 

HGSLGPN1VQTHHLHQQLTQPLTLRKSSPPTEHLLSQSMQHLTQQQAIHLHHLLGQQQ 

35 QQQQASHPQQQQQQQHSPHSLVRVKJCEPNVGQRHLSPHHQQQSPLLQHHQQQQQQQQQ 
QQQHLHQQQQQQQHHQQQPQALALMHPASLALRNSNRDAAILFRVKSEVHQQVAAGLP 
HLMQSAGGAAAAAAAAVAAQRMVCFSNARINGVKPEVIGGPLGNLRPVGVGGGNGSGS 
VQCPSPHPSSSSSSSQLSPQTPSQTPPRGTPTVIMGESCGVRTMVWGYEPPPPSAGQS 
HGQHPQQQQQSPHHQPQQQQQQQQQQSQQQQQQQQQQSLGQQQHCLSSPSAGSLTPSS 

40 SSGGGSVSGGGVGGPLTPSSVAPQNNEEAAQLLLSLGQTRIQDMRSRPHPFRTPHALN 
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MERLWAGDYSQLPPGQLQALNLSAQQQQWGSSNSTGLGGVGGGMGGRNLEAPHEPTDE 
DEQPLVCMICEDKATGLHYG1ITCEGCKGFFKRTVQNRRVYTCVADGTCEITKAQRNR 
CQYCRFKKCIEQGMVLQAVREDRMPGGRNSGAVYNLYKVKYKICHKKTNQKQQQQAAQQ 
QQQQAAAQQQHQQQQQHQQHQQHQQQQLHSPLHHHHHQGHQSHHAQQQHHPQLSPHHL 

5 LSPQQQQLAAAVAAAAQHQQQQQQQQQQQQQAICLMGGVVDMKPMFLGPALKPELLQAP 
PMHSPAQQQQQQQQQQQQQQASPHLSLSSPHQQQQQQQGQHQNHHQQQGGGGGGAGGG 
AQLPPHLVNGTILKTALTNPSEIVHLRHRLDSAVSSSKDRQ1SYEHALGMIQTLIDCD 
AMED1ATLPHFSEFLED1CSE1SEKLCNIGDSIVHKLVSWTICKLPFYLEIPVEIHTICLL 
TDKWHEILILTTAAYQALHGK.RRGEGGGSRHGSPASTPLSTPTGTPLSTPIPSPAQPL 

10 HKDDPEFVSEWSHLSTLQTCLTTLMGQPIAMEQL 

KMEEYVCLKVYILLNKGTWFDLQNPF1QCSCYLLVRFVNPAEVELESIQERYVQVLRS 

YLQNSSPQNPQARLSELLSHCPEIQAAASLLLESKMFYVPFVLNSASIR 
ORIGIN 

15 2. SEQ ID NO: 2 Accession No. NM_130611 Drosophila melanogaster 



CG16902-PA 



25 



20 



40 



45 



35 



30 



50 



55 



1 atgacactga gccgtggccc gtacagegag ctcgataaaa tgagcctttt tcaagacctc 
61 aaactcaaac ggcgcaaaat egattcgega tgcagcagtg aeggegagtc catageggae 
121 acgtccacct cgtcgccgga cctgctggcg cccatgtcgc cgaagctctg cgacagcggc 
1 8 1 teggeggggg cgtcgctggg ggcatcgctg cccctgccgc tggccctgcc cctgccaatg 
241 gccctgccac tgcccatgtc gctgcccctg cccctcacgg cggcatcttc ggcggtcacc 
301 gtttcgctgg cageggtegt ggccgcggtg gecgagaegg gtggcgcggg cgcgggagga 
361 gctgggacag cagtaacagc gtegggagea ggaccatgcg tctccacgtc gtctacgacg 
421 geageggcag ccacatcctc gacctcctcg ctctcgtcct cctcctcttc gtcatcctcc 
481 acgtcctcca gcacttcctc cgcctcgccg acagctggag cctcctccac ggccacctgc 
541 cccgccagca gcagcagcag cagtggaaac ggaagtgggg gcaaaagtgg tagcatcaag 
601 caggagcaca eggagataca ctcgtcgagc agtgcgattt cggcggccgc cgcctcaacg 
661 gtgatgtcac cgccgcccgc tgaggcgacg agatccagtc cagccacgcc cgagggaggc 
721 ggaccagctg gcgacggaag tggagcaacg ggaggcggaa acacgagegg eggatcaacg 
781 gctggagtgg ccattaatga acaccaaaac aatggcaatg geageggegg gagcagtcga 
84 1 gcctctcccg attcgctgga agagaagece tctaccacaa cgaccacagg tcgtccaacg 
901 ctcacgccca cgaatggggt gctgtcctcc gcctcggcgg geaeggggat ttccacagga 
961 agcagcgcca agetgagega ggctggtatg agtgtgatac ggtccgtgaa ggaggagege 
1021 ttgctcaacg tatccagcaa gatgctggtg ttccatcagc agegggagea agagaccaaa 
1081 gcagtggcgg ctgcagcagc ageagcageg gcgggccatg tgacggttct agtgacgcca 
1141 tcgcgcatca aateggagee accgccgccg gcttcaccct cctctacatc cagcacacaa 
1201 agggaaaggg aacgggaacg cgatcgagag agggatcgeg aaagggaacg egagegggae 
1 26 1 egggaceggg aacgggaacg ggaacagtcc atcagctcct cgcagcagca ectaagtegg 
1 32 1 gtctccgcca gtccacccac tcagctgtcc cacggcagcc tgggacccaa cattgtgcag 
1381 acgcaccatc ttcaccagca actcacacag ccgctgacgc tgegcaagag cagcccgccc 
1 44 1 acagagcacc tgetcagtea gtccatgcaa catctcacac agcagcaggc gatccacctg 
1 501 catcacctac ttggccagca gcagcagcag cagcaggegt cgcatcccca gcagcaacag 
1561 cagcagcaac actcgcccca ctccctggtg cgggtgaaaa aggaaccgaa tgttggtcag 
1 62 1 eggcacttat cgccgcatco ccaacaacag tcgccactcc tgcagcacca ccaacagcag 
1 68 1 cagcagcagc aacaacaaca gcaacagcat ctgcatcagc aacagcaaca gcagcagcat 
1 74 1 caccagcagc agccccaggc actggccctg atgeatcegg cttccctggc gctaaggaac 
1 801 ageaateggg atgeggecat tetgmegg gtgaagagcg aagtgcacca gcaggtggcc 
1 861 gccgggctgc cgcatctgat gcagtccgct ggtggggcag cggccgccgc cgcagcagct 
1921 gtggccgctc agcgaatggt atgettcage aatgccagga tcaatggcgt taagceggag 
1 98 1 gtgattggag gaccgctggg caacctgcgg cccgtgggcg tcggtggcgg aaacggaagt 
2041 ggctccgtgc agtgcccctc gccgcatcca tcctcctcgt cgtcatcctc gcagctgtcg 
2101 ccgcagacgc cctcccagac gccgccccga ggcacgccca ccgtcataat gggegagage 
2161 tgcggggtgc gcaccatggt ctggggctac gagcctccgc caccctcggc gggccagtcc 
222 1 cacggccagc acccgcaaca gcaacagcag tcgccccacc accagccgca acaacaacag 
228 1 cagcagcaac aacagcagtc gcagcagcaa cagcaacagc agcagcaaca gtcgctgggc 
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234 1 cagcagcagc actgcctctc ctcgccgtcg gegggatege tgacgccctc cicttcgtcc 
2401 ggcggtggtt eggtatctgg eggeggagtg ggcggaccac tcacaccctc ctcggtggcg 
246 1 cegcagaata acgaggaggc cgcccaactc ctgctctccc tgggacagac acgcatccag 
252 1 gacatgagat cacggccaca ccccttccgc acaccgcacg cccttaatat ggagcggctg 
5 2581 tgggegggag actactcgca attgccgccc ggccagctgc aggctctgaa tctcagtgcc 

264 1 caacagcagc agtggggcag cagcaactcc aegggtcttg gtggcgtagg eggeggcatg 
270 1 ggeggacgea acctggaggc gccgcacgag ccgaccgacg aggacgaaca gccgctcgtt 
276 1 tgeatgatet gcgaggacaa ggccaccggc ctgcactacg gcatcatcac ctgcgagggg 
282 1 tgeaaggget tettcaageg gacggtgcag aaccgacgag tctacacctg cgtggcggac 
10 2881 ggcacctgcg agataaccaa agcacagcgc aaccgttgtc agtattgtcg atttaagaag 
294 1 tgeatcgage agggcatggt gctgcaagcc gttcgegagg atcgcatgcc gggeggtege 
300 1 aacagtggcg ccgtctacaa tttgtacaag gtgaagtaca agaagcacaa gaagaccaat 
3061 cagaagcagc agcagcaggc cgcccagcag cagcagcagc aggeggegge gcagcagcag 
3121 caccagcaac agcagcagca tcaacagcac cagcaacatc agcaacagca gttgeacteg 
15 3181 ccgctccacc atcaccacca ccagggccac cagtcgcacc acgcgcagca gcagcaccac 
324 1 ccacagctgt cgccgcacca cctgctgtcg ccgcagcagc ageaacttge cgccgcggtg 
3301 gcagcagctg cgcagcacca acagcaacag caacaacagc agcaacagca gcagcaggcc 
336 1 aagctgatgg gcggcgtggt ggacatgaag cccatgttcc tcggccccgc tttgaagccg 
342 1 gagttgctgc aagcaccccc catgeacagt ccggcccagc aacaacaaca gcagcagcag 
20 3481 cagcagcagc aacagcaggc ctcgccgcat ctctcgctta gctcaccgca ccagcagcag 

354 1 cagcagcagc agggacagca ccaaaaccac caccagcaac aaggtggggg tggcggagga 
360 1 gctggtggag gagctcaact gccgccgcac ctggtgaacg gaacgatact gaagaeggee 
366 1 ctaaccaatc ecagegagat tgtacatctg cgccaccgcc tcgactcggc ggtcagttcg 
372 1 tccaaggacc gacagatctc gtacgagcac gecttaggea tgatccagac actgatcgac 
25 378 1 tgegacgega tggaggacat agccacactg ccgcacttca gcgagttcct tgaggacaag 

384 1 teggagatta gcgagaaact gtgeaacate ggegattcca tagtccacaa gctggtgtcg 
390 1 tggacaaaaa agttgccctt ctacctggag atcccggtgg agatacatac caaactactg 
396 1 aeggacaagt ggcacgagat ccttatcctg accacggccg cctaccaggc gttgcatggc 
402 1 aagcggcgtg gcgagggagg aggcagcagg catggttcgc cggcgtcaac gccactgagc 
30 408 1 acgcccactg gtacgccgtt gagcacaccg ataccctcgc ccgcccagcc actgeacaag 

4141 gacgacccgg agtttgtcag cgaggtgaac tcgcacctga gcacactgca aacctgettg 
4201 accacgctaa tgggccagcc gatagegatg gagcagctga agctggacgt egggcacatg 
4261 gtggacaaga tgacccagat caccatcatg ttccggcgaa tcaagctcaa gatggaggag 
432 1 tacgtctgcc tgaaggttta catactgeta aacaaaggta cgtggttcga tttgeaaaac 
35 438 1 ccattcatac agtgctcatg ttaccttctc gttcgttttg taaatccagc agaagtggaa 

444 1 ctggagagca tecaggageg gtacgtccag gtgetgeget cctacctgca aaactcctcg 
4501 ccgcagaatc egcaggegag gctcagtgaa ctgctctccc acataccaga gatccaggct 
4561 gcggctagcc tgetgetega gagcaagatg ttctatgtgc ccttcgtgct caactcggcg 
462 1 agcataaggt ag 



40 



3. SEQ ID NO: 3 Accession No. NM_168775 Drosophila melanogaster ftz 
transcription factor 1 CG4059-PA 



MLLEMDQQQATVQFISSLN1SPFSMQLEQQQQPSSPALAAGGNS 

45 SbWAASGSN^SASGNNTSSSSNNNNNN^ 

SGNQQHHSNHSNHGNHHQQQQQQQQQQQQHQQQQQEHYQQQQQQN1ANNANQFNSSSY 
SYIYNFDSQYIFPTGYQDTTSSHSQQSGGGGGGGGGNLLNGSSGGSSAGGGYMLLPQA 
ASSSGNNGNPNAGHMSSGSVGNGSGGAGNGGAGGNSGPGNPMGGTSATPGHGGEVIDF 
KHLFEELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNKXVYTCVAERSCHIDKTQRK 

50 RCPYCRFQKCLEVGMKLEAVRADRMRGGRNKFGPMYKKDRARKLQVMRQRQLALQALR 
NSMGPDIKPTP1SPGYQQAYPNMNIKQEIQIPQVSSLTQSPDSSPSPIAIALGQVNAS 
TGGV1ATPMNAGTGGSGGGGLNGPSSVGNGNSSNGSSNGNNNSSTGNGTSGGGGGNNA 
GGGGGGTNSNDGLHRNGGNGNSSCHEAGIGSLQNTADSKLCFDSGTHPSSTADALIEP 
LRVSPM1REFVQSIDDREWQTQLFALLQKQTYNQVEVDLFELMCKVLDQNLFSQVDWA 

55 RNTVFFKDLKVDDQMKLLQHSWSDMLVLDHLHHRJHNGLPDETQLWGQVFNLMSLGL 
LGVPQLGDYF^^ELQNK1^QDL1CFDMGDYVCMKPLILLNPSVRGIVNRXTVSEGHD^^VQA 
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ALLDYTLTCYPSVNDKFRGLVNILPEIHAMAVRGEDHLYTKHCAGSAPTQTLLMEMLH 
AKRKG 

5 

4. SEQ ID NO: 4 Accession No. NM_1 68775 Drosophila melanogaster ftz 
transcription factor 1 CG4059-PA 

1 ctacgcaaaa taaaacgtac atgaaatgtt attagaaatg gatcagcaac aggcgaccgt 
10 61 acagtttata tcgtcgctga atatatcgcc gttcagcatg cagctggagc agcagcagca 

121 gccctccagt cccgctctgg ccgccggtgg caacagcagc aacaacgcgg ccagcggtag 

1 8 1 caacaacaac agcgccagcg gcaacaacac cagcagcagc agcaacaaca acaacaacaa 

241 taacaacgac aatgatgcac acgttctaac gaaattcgag cacgaataca atgcctacac 

301 gttgcagttg gccggaggcg gtgggagtgg cagcggcaat cagcagcacc acagcaacca 
15 361 cagcaaccac ggcaaccacc accagcagca gcagcaacaa cagcaacagc agcagcaaca 

42 1 tcagcagcag cagcaagaac actaccagca gcaacagcaa cagaatatcg ccaacaatgc 

481 caatcaattc aactcctcgt cctactcgta tatatacaat ttcgattcac agtatatatt 

541 cccgacaggc taccaggaca ccacctcctc acactcgcaa cagagcggag gaggcggtgg 

60 1 cggcggcggt ggcaacctgc taaacggcag ctccggcggc agctccgccg gcggtggcta 
20 66 1 catgctgctc ccccaggcgg ccagctccag tggcaataat ggcaatccga atgccggcca 

721 catgtcctcc ggttccgtgg gcaatggcag cggaggcgct ggcaatggcg gagcgggcgg 

781 caactccggt cccggcaatc ccatgggcgg tacgagcgcc acgccgggac acggcggcga 

841 ggtgatcgac ttcaagcacc tgttcgagga gctttgcccc gtgtgtggcg acaaggtgag 

901 cggctaccac tacggcctgc tcacctgcga gtcctgcaag ggattcttca agcgcaccgt 
25 961 gcagaacaag aaggtctaca cctgcgtggc ggagcggtcg tgccacatcg acaagacgca 

1021 gcgcaagcgg tgtccctact gccgattcca gaagtgcctc gaggtgggca tgaagctaga 

1081 ggctgttcga gcggatagaa tgcgtggtgg acgcaacaaa ttcggaccca tgtacaaacg 

1141 ggatcgcgcg cggaagttgc aagtgatgcg gcagcggcag ttggcgctgc aagcgctgcg 

1201 caactcgatg ggtccggaca tcaagccaac gccgatctcg ccgggctacc agcaagcata 
30 1 26 1 tccaaatatg aacattaagc aggaaattca aatacctcag gtatcctcac tcacccaatc 

1321 tccggactcg tcgcccagcc ccatagcaat tgcgttggga caggtgaacg cgagcacggg 

1 38 1 cggtgttata gccacgccca tgaacgccgg cactggcggc agtgggggcg gtggtctgaa 

1441 cggaccaagt tccgtgggca acggcaatag cagcaacggc agcagcaacg gcaacaacaa 

1501 cagcagcacg ggcaacggaa cgtccggagg aggaggtggc aataatgcgg gcggcggagg 
35 1561 aggaggaacc aattccaacg atggcctgca tcgcaacggc ggcaatggca acagcagttg 

162 1 ccacgaggct ggaataggat ctctgcagaa cacggccgac tcgaaattgt gcttcgattc 

1681 tggcacacat ccatcgagca cagccgacgc gctaatcgag ccattaagag tctcaccgat 

1 74 1 gattcgtgaa tttgtgcaat ctattgacga tcgggaatgg cagacgcaac tgtttgccct 

1801 gctgcagaag caaacctaca accaggtgga agtggatctc ttcgagctga tgtgcaaagt 
40 1 86 1 gctcgaccag aatttgttct cgcaagtaga ctgggcacgg aacaccgtct tcttcaagga 

1 92 1 tctgaaggtc gacgaccaaa tgaagctgct gcagcattcc tggtcggaca tgcttgttct 

1981 ggatcacctg catcatcgaa tccataacgg cctgcccgac gagacgcaac tgaacaatgg 

2041 tcaggtgttc aatctgatga gtctgggttt gttgggagtg ccacagctgg gcgattactt 

2101 caacgagctg cagaacaagc tgcaggacct gaaattcgat atgggcgact atgtctgcat 
45 2 1 6 J gaaattccta atcctgttga atccaagtgt acggggtatt gtcaaccgga agaccgtctc 

2221 cgagggacat gataatgtgc aagccgcttt gctggactac accctcacct gctatccgtc 

228 1 agtgaatgac aaattcagag ggctagttaa catcttaccg gaaatccatg ccatggccgt 

2341 tcgcggcgag gatcacctgt acaccaagca ctgtgccggc agtgcgccca cccaaacgct 

240 1 gctcatggag atgctgcacg ccaagcgcaa gggatagagg ccgggagaac gtgacacgga 
50 246 1 atacttaatc atttatgaaa tgtaaataac aaggcgggaa ggccctcggg gcaaccgggt 

2521 catggaaggc gaacgaagga tacagcagaa ttccgtatta tgaatatggg aatgcatcat 

2581 cactactacc accaactatc acacctatac acacacatgc acacatttgt tgattcaatg 

2641 ttaattatta ttacgtttac ggttaggtct agtttacgtt taactaatta attaatngt 

2701 cttaaattaa ttcgtgtttt atttgiagtc ccigataaag caattttaaa acacttgaac 
55 2761 ctaaacgaga atatgtagta gatgtatgga tttaaattta aatacggcaa ggagaaacac 

2821 acttttttag gcattacaaa acaaaagaag catgagaaat tttattttta tatacctata 
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288 1 tgaatacgai acttatggat acaaatctat atataltttl atgtaaattg gcgtacttu 
294 1 agcgtcctac atatttttta attagaattt ggttatacla tagtutgaa attagtatcg 
3001 ttcccactlg aagatcgatt cttgtatttt tugegecaa gtgtcttgca tagtatttgc 
3061 gtctaatcta atggcaacaa aaaaaatatt ggaaaalcca tacaaagaaa atgaaaacaa 
5 3121 agcaaattta ggtgttcatg gtatgaatgt atgtgtatat tataattgta atttca tcta 

3181 agtgtaagaa aacaatgeaa acaactacct acaacaagat aatgaagagc aagaaattat 
324 1 ataaattaat aaaggtcgtg ttaaaaact 

5. SEQ ID NO: 5 Accession No. NM176123 Drosophila melanogaster 
10 Hormone receptor-like in 46 CG33183-PA 

MYTQRMFDMWSSVTSKLEAHANNLGQSNVQSPAGQNNSSGSIKA 
Q1E11PCKVCGDKSSGVHYGVITCEGCKGFFRRSQSSVVNYQCPRNKQCVVDRVNRNR 
CQYCRLQKCLKiGMSRDAVKFGRMSKXQREKVEDEVRFHRAQMRAQSDAAPDSSVYDT 
15 QTPSSSDQLHHNNYNSYSGGYS^r^^EVGYGSPYGYSASVTPQQTMQYDISADYVDSTTY 
EPRSTIIDPEF1SHADGDINDVL1KTLAEAHANTNTKLEAVHDMFRKQPDVSRILYYIC 
NLGQEELWLDCAEKLTQMlQNIIEFAlCLIPGFMRLSQDDQILLLKTGSFELArVRMSR 
LLDLSQNAVLYGDVMLPQEAFYTSDSEEMRLVSRJFQTAKS1AELKLTETELALYQSL 

20 VLLWPERNGVRGNTEIQRLI^SNmAIRQELETNHAPLKGDVTVLDTLLNNlPNFRDl 

SILHMESLSKFKLQHPNWFPALYKELFS1DSQQDLT 

6. SEQ ID NO: 6 Accession No. NM_1 76123 Drosophila melanogaster 
25 Hormone receptor-like in 46 CG33183-PA 

1 gaattcattc aactgeaaag ageagecaaa ttgegcatae gecgegtatg geegteggtg 
61 tgagtgcccg tgttcatcag eggttgeate aactgatacc aagtgtacat aactacagct 

1 2 1 acaattgeaa ctatttcacc aatcaaegge ageggcaaca acatcagcaa cagcaccggc 
30 181 aaacgtttga aacgtcacca aagcttcgea tttcccacta ataattatgt ataegcaacg 

241 tatgtttgac atgtggagca gcgtcacttc gaaactggaa gcacacgcaa acaatctegg 

30 1 tcaaagcaac gtccaatcgc eggegggaca aaacaactcc ageggttcca ttaaagctca 

361 aattgagata attccatgea aagtctgegg cgacaagtca tccggcgtgc attaeggagt 

42 1 gatcacctgc gagggctgea agggattctt tcgaagatcg cagagctccg tggtcaacta 
35 48 1 ccagtgtccg cgcaacaagc aatgtgtggt ggaccgtgtt aatcgcaacc gatgtcaata 

54 1 ttgtagactg caaaagtgee taaaactggg aatgagccgt gatgctgtaa agttcggcag 

601 gatgtccaag aagcagcgcg agaaggtcga ggacgaggta cgcnccatc gggeccagat 

661 gcgggcacaa agegacgegg caceggatag ctccgtatac gacacacaga cgccctcgag 

721 cagcgaccag ctgcatcaca acaattacaa cagctacagc ggeggctact ccaacaacga 
40 78 1 ggtgggctac ggcagtccct aeggataetc ggcctccgtg acgccacagc agaccatgea 

841 gtacgacatc teggeggact acgtggacag caccacctac gagccgcgca gtacaataat 

90 1 cgatcccgaa tttattagtc aegeggatgg cgatatcaac gatgtgctga teaagacget 

961 ggcggaggcg catgccaaca caaataccaa actggaagct gtgeacgaca tgttccgaaa 

1021 geagceggat gtgtcgcgca ttctctacta caagaatctg ggecaagagg aactctggct 
45 108 1 ggactgcgcc gagaagctta cacaaatgat acagaacata atcgaatttg ctaagctcat 

1141 acegggatte atgcgcctaa gtcaggacga tcagatatta ctgetgaaga cgggctcctt 

1201 tgagctggcg attgttcgca tgtccagact gcttgatctc tcacagaacg eggttctcta 

1261 eggegaegtg atgctgcccc aggaggegtt ctacacatcc gaeteggaag agatgegtet 

1 32 1 ggtgtcgcgc atcttccaaa cggccaagtc gatagecgaa ctcaaactga ctgaaaccga 
50 1381 actggcgctg tatcagagct tagtgetget ctggccagaa cgcaatggag tgcgtggtaa 

1441 tacggaaata cagaggcttt tcaatctgag catgaatgeg ateeggcagg agctggaaac 

1501 gaatcatgcg ccgctcaagg gcgatgtcac cgtgctggac acactgetga acaatatacc 

1561 caatttccgc gatatttcca tettgeacat ggaatcgctg agcaagttca agetgeagea 

1621 cccgaatgtc gtttttcegg cgctgtacaa ggagctgttc tcgatagatt cgcagcagga 
55 1681 cctgacataa caagagcagc agccgttcct ggagacgacc gcggacgatg ttgccgagga 
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174! tgcggctgcc gccggatgtg icctgccgcc ggtggcgccc ccigccgggc agcaaccagc 

1801 getgetcgag gactgagggc cgcaggatgt ggcaacaata attatngag laaacactgc 

1S61 actgegcatg cagcagatac aagaactua tcatganta agctagcata caaccaagga 

1921 tgtgatcctc gecaaggact cacttaaaaa gaactclatc tatatacata tatatattat 
5 1 98 1 atatgacaga gcggatgacg caaagggaag ggaaaatatt tcaaaaatat tgttaactca 

2041 gttaagactt ttgcttcgta gagaaccgaa accgaaaccg attgeattte gagcaagggg 

2101 catcaaactg attttcgagg ttatactata catatataca cacaaacaca cacacacaca 

2161 tatatatata tgtaacttcc aaactttcat atcctggccc gagcagatca gategtctaa 

222 1 gtacttaaaa ecaagegaaa ttctctacac cgcacaaccc aggaccegta gaccccaata 
1 0 228 1 attcagttcg gttagtgtta accccagaaa gcccgattcc gatcccgcct aggttgtctt 

2341 tgccttacgt tgtaactaaa gtatgtgtat tatatataca gcaaatgtat gtataactat 

2401 gtegtategg ttatatgect aacaacatta ttttttgtaa acaacaaaat cgaatatctc 

2461 ggaaaatgtg ttcttataat tatattgatt aatgeaatta caatatattt acaatttacc 

252 1 gttacgtttt tacattatac ataagacgea agagaaggaa acggaagttt aaggattaga 
15 258 1 aagctgaata agaaaaggct taaggacgag ctgagtagca gttaaagtga gegagaaate 

264 1 gaatgaatac cagaaaattt caagcaagca cataaaagta tgcaatattt tgtttaaaaa 

2701 caacttttta ttagtttctt aaatataaca taattacgta catacacaca cgtatatata 

2761 gggctatata tatctatata tatatatata tacatgatag acaaatccca atccggttcc 

282 1 aaggtttagt aaaaataaag agaaataaaa cgaaaaacaa aaacttttga tatgaaatcc 
20 288 1 tacgeataat taacaacttt tattgtttct aagacttaaa cttaattaaa atggaaacca 

2941 aaacagactg acggaccgac cccgacagca tgccacgccc tcccccgccc caccctccac 

3001 agatcctggc agaaatttca aaggagtttg atacacaaat cgagaaaaga aattttcaaa 

3061 aaaataatat aaagacaagc aaaeggegae ttttttggtt gatacatttg aaaagaatat 

3121 acaattaaat atctgactga ctatacaaag aegttacaca cacgcataca catacacaca 
25 3181 catacacgca tacacacaca gcttacgata cataaattag ttaaacttag agtaaacaaa 

3241 caacaacaaa cacattggat agtaggtgat aattggtgtg tcttaaataa accttaaccc 

3301 ctccccgacc cccgcccact tgettaatae ccaacgcccc aaaaagcccc acatttctac 

3361 taaatgaaaa gcttaatcaa aacttttttg aaattattca agtgaaaatt tcagcaggca 

342 1 ggcataaata ttaattaaca ttaattatag caaggaaact tataaataaa atgtatacaa 
30 348 1 caaaactaca aaaattaaat aaattacatt ttgeaaatte cacaaaaaat aaaacatgat 

354 1 tttgeaaatt cacttaaaat cctttccctg aatccaagca aaaatattta cactagctta 

3601 catagaactg ggacgaggac atgaatattt caattgagaa aaaaatctat gttaatgtaa 

3661 tcgatcgatt tggacatatt taagttcgac atttttggee ttacaaaaca aaaaacaaaa 

372 1 agaagaaacc taaagtactt tatatatata caaaccatat atacaatata gagaatacaa 
35 378 1 aactagtttt aatttataca aagcaaggga gcagctttca aactcaaaac aaaaatatcc 

384 1 ccgaaaaaaa caacaacttt gttaaaaact gegcataata aagaaaataa taaacaaagt 

3901 taatctataa tataaattga agttaagttg atttgagegg tcgacaacaa gaacataaat 

3961 gtatctttaa atgatatatg tattgttaaa tttgtatgct aagtttttag aaaggttaca 

402 1 tttttaaaga ataataacaa aagatcgega actcgacaag gtgtaaaatg agtacattta 
40 408 1 aattaaaatt tagcatatat aatgcataaa tattatgtta cgatatttac atttatataa 

4141 aacaaaacaa aaacactaaa gaaaaccgaa aaaacagaag tcccatatta aaaatgaaat 

420 1 aaaatgagca gaacctataa actgataagg gaattctgaa tattaaaaaa aaaaagaaaa 

4261 ca 

45 

7. SEQ ID NO: 7 Accession No. NM_079769 Drosophila melanogaster 
Hormone receptor-like in 96 CG11783-PA 

MSPPKNCAVCGDICALGYNWAVTCESCKAFFRRNALAKKQFTCP 
50 FNQNCDITVVTIU^FCQKCRLRKCLDlGMKSENIMSEEDKLlKRRKIETNRAKJm.MEN 

GTDACDADGGEERDHKAPADSSSSNLDHYSGSQDSQSCGSADSGANGCSGRQASSPGT 
QVNPLQMTAEKJVDQIVSDPDRASQA1NRLMRTQKEAISVMEKVISSQKDALRLVSHL 
IDYPGDALKIISKFNWSPWALTVFTKJMSSPTDGVEIISK1VDSPADVVEFMQNLMH 
SPEDA1D1MN1CFMNTPAEALR1LNRILSGGGANAAQQTADRKPLLDICEPAVKPAAPAE 
55 RADTVIQSMLGNSPPISPHDAAVDLQYHSPGVGEQPSTSSSHPLPY1ANSPDFDLKTF 
MQTNYNDEPSLDSDFS1NSIESVLSEVIRLEYQAFNSIQQAASRVICEEMSYGTQSTYG 
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GCNSAANNSQPHLQQPICAPSTQQLDRELNEAEQMKLRELRLASEALYDPVDEDLSAL 
MMGDDRIKPDDTRHNPKLLQLINLTAVAIKRLIKMAKXITAFRDMCQEDQVALLKGGC 
TEMMIMRSVMIYDDDRAAWKVPHTKENMGN1RTDLLICFAEGNIYEEHQKFITTFDEKW 
RJS1DENIILIMCA1VLFTSARSRVIHKDVIRLEQNSYYYLLRRYLESVYSGCEARNAFI 

5 

KL1Q1CISDVERJLNKFIINVYLNVNPSQVEPLLRE1FDLKNH 

8. SEQ ID NO: 8 Accession No. NM_079769 Drosophila melanogaster 
Hormone receptor-like in 96 CG11783-PA 

10 

I gttattggga ttggcctgga gcactcggac ggacagtaat tcattaaaat atgtggtgat 

61 aacgegaget gecgaatctg cgtgcaattc gtgcgtttga cgtgggtact aactgetatg 

121 ctgtcgcgcg gacagttgtt ctgatacgea gagttcctgc ctcaccacac acgaccacct 

181 ccattaaaac cagccacccc ccccagcgcc tcctccaccg acagcagctg ctccaccgca 
15 24 1 ccaccaggag aggggcaatt aaaaaatcaa teagagggee ctaattgaaa gctgccaccg 

301 tcgaaatgtc gccgccgaag aactgegegg tgtgcgggga caaggctctg ggctacaact 

361 teaatgeggt cacctgcgag agetgeaagg cgttcttccg aeggaacgeg ctggccaaga 

421 agcagttcac clgccccttc aaccaaaact gcgacatcac tgtggtcact cgacgcttct 

481 gecagaaatg ccgcctgcgc aagtgcctgg atategggat gaagagtgaa aacattatgt 
20 541 ccgaggagga caagctgatc aageggegea agatcgagac caaccgggcc aagcgacgcc 

601 tcatggagaa eggcaeggat gcgtgcgacg ccgatggcgg cgaggaaagg gatcacaaag 

66 1 cgccggcgga tagcagcagc agcaaccttg accactactc ggggtcacag gaetegcaga 

721 gctgcggctc ggeggacage ggggccaatg ggtgctccgg cagacaggcc agttcgccgg 

781 gcacacaggt caatccgctt cagatgaegg ccgagaagat agtcgaccag ategtatccg 
25 84 1 acccggatcg agcctcgcag gccatcaacc ggttgatgcg caegcagaaa gaggctatat 

901 cggtgatgga gaaggtaatc agctcacaaa aggaegcett aaggctggtg tegcatttga 

961 tcgactatcc aggegacgea ctcaagatca tttcaaagtt tatgaactcg ccctttaacg 

1021 cgctgacagt attcaccaaa ttcatgagct cacccacgga cggcgttgaa attatctcaa 

1081 agatagttga ttcgcccgcg gacgtggtgg agttcatgca gaacttgatg cactcgccag 
30 1141 aggacgecat cgatataatg aacaagttca tgaatacccc ageggaggeg ctgcgcattc 

1201 ttaaccgaat cctaagcggc ggaggagega acgcagccca gcagacagca gaccgcaagc 

1261 cattgctgga caaggagecg gcggtgaagc ctgcagcgcc ageggagega gctgatactg 

1 321 tcattcaaag catgetggge aacagtccgc caatttcgcc acatgatget gccgtggatc 

1381 tgcagtacca ctcgcccggt gteggggage agcccagtac atcgagtagc caccccttgc 
35 1 44 1 cttacatagc caactcgccg gacttcgatc tgaagacctt catgcagacc aactacaacg 

1 501 acgagcccag tctggacagt gattttagca ttaactcaat egaateggtg ctatccgagg 

1 561 tgatcegcat tgagtaccag gecttcaata gcatacaaca ageggcateg cgegtaaagg 

1 621 aggagatgtc ctacggcact cagtctacgt acggtggatg caattegget gcaaacaata 

1 681 gccagccgca cctgcagcaa cccatctgcg ccccatccac ccagcagttg gatcgegage 
40 1 74 1 taaacgaggc ggagcaaatg aagctgcggg agetgegact ggecagegag gctctttatg 

1 801 atcccgtgga cgaggacctc agcgccctga tgatgggcga tgategcatt aagcccgacg 

1 861 acactcgcca caacccaaag etattgeage tgatcaatct gaeggeggtg gccatcaagc 

1 92 1 ggcttatcaa aatggccaag aagattacag cattccgtga catgtgccag gaggaccagg 

1 981 tggccctact caaaggtggc tgcacagaaa tgatgataat gcgctccgta atgatttacg 
45 2041 acgacgatcg cgccgcctgg aaggtacccc ataccaaaga gaacatgggc aacatacgea 

2101 ctgacctgct caagtttgee gaaggcaata tctacgagga gcaccaaaag ttcatcacaa 

2 161 cgtttgacga gaagtggcgc atggacgaga acataatcct gatcatgtgt gccattgtcc 

2221 tttttacctc ggctcgatcg cgagtgatac acaaagacgt gattagattg gaacagaatt 

2281 cctactatta tettctgega agalatctgg agagtgttta ttctggctgt gaggegagaa 
50 2341 aegegtttat caagctaatc caaaagattt cagatgtgga gegtctgaac aagttcataa 

240 1 ttaatgtcta tttgaatgtt aacccatccc aggtggagcc ettgetgegt gaaatattcg 

2461 atttgaaaaa tcactagaca accgatgegt gtegggcatt taatgectat gttgatgece 

2521 aatgatgaat ggtcaacaag ctgtagttgt tgttgttgtt gatgtctgtt ttatcttgtc 

2581 gcttgtaatg ttagatttta atcgaatgtg attgttagat rtgeatatae tgcalagatt 
55 264 1 ttatatttct acatcaaaga gagcatattt aggataccaa gtgeaaagea acacaatcta 

2701 tatgtaatgt acacegttta cctagtttca aataaactag aegataatge aataactaac 
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2761 ttggaagcgt gggttctgtg caaaaaggaa aaaagacaaa aaaaataaac tgaclttgag 
2S2I aaccagtggtaa 

9. SEQ ED NO: 9 Accession No. NM_057539 Drosophila melanogaster 
5 Hepatocyte nuclear factor 4 CG931 0-PA 

MMKHPQDLSVTDDQQLMKVNKVEKMEQELHDPESESHIMHADAL 

ASAYPAASQPHSPIGLALSPNGGGLGLSNSSNQSSENFALCNGNGNAGSAGGGSASSG 

SNNNNSMFSPN^LSGSGSGTNSSQQQLQQQQQQQSPTVCAICGDRATGKHYGASSCD 

10 GCKGFFRRSVRKNHQYTCI^ARNCVVDKDICRNQCRYCRLRKCF10\GMKKEAVQNERDR 
ISCRRTSNDDPDPGNGLSVISLVKAENESRQSKAGAAMEPNINEDLSNKQFASrNDVC 
ESMKQQLLTLVEWAKQIPAFNELQLDDQVALLRAHAGEHLLLGLSRRSMHLKJDVLLLS 
NNCVITRHCPDPLVSPNLDISRJGARJIDELVTVMKDVGIDDTEFACIKALVFFDPNA 
KGLNEPHRIKSLRHQILNNLEDYISDRQYESRGRFGEILLILPVLQSlTWQMIEQfQF 

15 AK1FGVAHIDSLLQEMLLGGELADNPLPLSPPNQSNDYQSPTHTGNMEGGNQVNSSLD 
SLATSGGPGSHSLDLEVQHIQALIEANSADDSFRAYAASTAAAAAAAVSSSSSAPASV 
APAS1SPPLNSPKSQHQHQQHATHQQQQESSYLDMPVICHYNGSRSGPLPTQHSPQRMH 
PYQRAVASPVEVSSGGGGLGLRNPADITLNEYNRSEGSSAEELLRRTPLICIRAPEMLT 

20 APAGYGTEPCRMTLKQEPETGY 

10. SEQ ID NO: 10 Accession No. NM_057539 Drosophila melanogaster 
Hepatocyte nuclear factor 4 CG9310-PA 

25 1 agttgaattc cagtgacgtt ggaagaaaca actgeaaaag gcaaaaacaa agacaatgtt 

6 1 tataagctgt atattccget ttgattgata taaatgaata tatgcagtgc gecagttata 

1 2 1 caactgccct gcaaaagtca ctcattaaat aaaaaacgee cgagatgaat ttcacagegg 

1 8 1 eggcaacaag tgcaataata gtaaaaaatc aaaagccaaa caacgaaatc tctcccaaaa 

24 1 aaacgaagaa gcgtgtcgcg gtgccaaaaa gaaaacaaaa atagaaaaat acacaacaaa 
30 301 ataataegga gaaacgttaa ttataacgag ccacaaaatc gcataaagaa atcaacaagt 

36 1 gtgtgtctgc ctttttttcc atattcgett teattcatge ggtcaactca acaataacaa 

421 ctcaaaatag caacaacaac aataacaata tcaacaagag cagcagcagt cgctgataaa 

481 agccctgcag ctaaaacaac aacaaaacaa caaagatagt tagaaagaac ategtctgge 

541 cattgagctt taattgeegg tcattacttc attactatgt gattggatct tcccgaccca 
35 601 cttgtaaata aaaagtaaaa atactggtta tgaagcatga tgaagcatcc gcaggatctg 

66 1 agtgtcacgg atgaccagca gttaatgaag gtgaacaagg tggagaagat ggagcaggag 

72 1 ttgcacgacc ccgaatcgga gagecacata atgeacgegg atgccctggc ctctgcctat 

781 ccggctgcct cgcagcccca cagtccgatc ggcctcgccc tcagccccaa tggcggtggg 

84 1 ctgggactga gcaacagtag caaccagagc agegagaact ttgcgctctg caaeggaaac 
40 901 ggaaatgegg geagegcagg aggeggaagt gecagcagtg gcagcaacaa caacaacagc 

961 atgttctcac ccaacaacaa ettgagegga ageggaagtg ggactaacag cagtcagcag 

1021 caattgeage agcaacaaca acagcaatca ccgacggtct gcgccatttg tggagategg 

1081 gegaegggea aacattatgg agcctccagc tgegaegget gcaaaggatt cttcaggagg 

1141 agtgtcagga aaaatcatca gtacacttgc agatttgege gaaactgcgt tgtggacaag 
45 1201 gaeaaaegga atcagtgccg ctactgccgg ctgaggaagt gcttcaaggc gggcatgaag 

1 26 1 aaggaggegg tgeaaaaega gegggatege attagctgee gccgcacctc caatgacgac 

1321 ccggatccgg gcaatgggct gtctgtgatt tccttggtta aggeggagaa tgagtcgcgt 

1 38 1 cagtcgaagg caggcgctgc catggageca aacattaacg aggacctctc caacaagcag 

1441 ttcgegagea tcaacgatgt ctgegagteg atgaagcagc agetgetgae cctggtggaa 
50 1 501 tgggctaagc agattcegge ctttaacgag ctgcagctgg atgaccaggt ggcactgeta 

1 561 cgcgcccatg ctggcgagca tttgctcctc ggcctgtctc gtcgttcgat gcacttgaag 

1 62 1 gaigttctcc tgctgagcaa caattgtgtg atcacaaggc acigtccaga tccccttgtg 

1 68 1 tegecgaatt tggacatctc ccggatcggc gcccgtatca tcgatgaact ggtgacggtc 

1 74 1 atgaaggatg tgggtatcga tgacactgaa ttegcttgea tcaaggccct agtcttcttc 
55 1 80 1 gatcccaatg ccaagggtct taatgaaccg catcgcatca aatcgctacg gcatcagata 
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1861 ctcaataatc tcgaggacta catatcagat eggcaatacg agtcgegegg tegcutgge 
192 1 gagattctgc tcatcctgcc ggttctgcag tctattacct ggcagatgat cgagcagatc 
1981 cagmgeca agatctttgg agtggcccac attgancai lactgeagga aatgttgug 
204 1 ggaggagagt tggccgacaa tcctctgccg ctatcgccgc ccaatcagtc aaatgactac 
2101 cagagtccca cccacacagg caacatggag ggeggtaate aagttaactc ctctctggac 
2161 tcgctggcca cgtccggtgg tcctggctcg catagtctgg acctggaggt gcagcacatt 
222 1 caggctctta tegaggegaa cagtgeggat gattccttcc gggcctacgc ggccagcact 
2281 geageggcag ccgctgcagc cgtctcgtcc tcctcctctg cacccgcatc cgttgctcca 
2341 gcctcgatct ctcctccgct caacagcccc aagtcacaac atcaacatca geaacatgeg 
240 ) acgcatcagc aacaacagga gagctcctac ttggacatgc ccgtcaagca ctacaatggc 
2461 agtcggtccg gaccgctgcc aacacagcac agtccccaga ggatgeatec ctaccaaaga 
252 1 gcagtcgcct cgccggtcga agtgtccagc gggggcggcg gattgggtct gcgcaatcct 
2581 gecgatatta cgctcaacga gtacaacegg agegagggta geagtgeega ggagctgctg 
2641 cgacgaactc cactgaagat ccgggctccc gagatgetaa ccgcacccgc tggttatgga 
2701 acggaaccct gtegcatgae acttaaacag gagecagaga ctggttacta gaagaataac 
2761 gaacggtgca atatgeagtt tgcaatagga caccccttaa gcacacaacc catacacata 
2821 caggccctct ettgetgtae tccccaccaa gtgetatata gagatgaaat tgaaatgaag 
2881 aacttactta attgttatgc cttgaaccat tttgatactt tttattagtc ctaagtaggt 
294 1 attttggaaa ttgttgctta atttttaatg tttaacgcag ttgeaatata tttttggagt 
3001 catattttgc tcaagaagtt tattatatac aattatacta tatatataca ccatttagca 
3061 tgtactgagt ttgttggtta tttggttatc ttatacttgt gcgtggatca caaaacattc 
3121 atataaggee atgeaatata ttgttttagg ttagggtgtt gtctagatta tgctgaaagt 
3181 gtaatatata tttaatttta aacaaagaac tatttttata tgaatatgta taatatacaa 
3241 actatttc 

11. SEQ ID NO: 11 Accession No. NM_176065 Drosophila melanogaster 
Hormone receptor-like in 38 CG1864-PC 

MDEDCFPPLSGGWSASPPAPSQLQQLHTLQSQAQMSHPNSSNNS 
S>WAGNSHNNSGGYNYHGHFNAINASA>n^SPSSSASSLYEYNGVSAAD 
QQSYQQHNYNSHNGERYSLPTFPTISELAAATAAVEAAAAATVSSPSVGGPPPVRRAS 
LPVQRTVSPAGSTAQSPKLAKJTLNQRHSHAHAHALQLNSAPNSAASSPASADLQAGR 
LLQAPSQLCAVCGDTAACQHYGVRTCEGCKGFFKRTVQKGSKYVCLAD1CNCPVD1CRRR 
NRCQFCRFQKCLVVGMVKEVVRTDSLKGRRGRLPSKPKSPQESPPSPPISLITALVRS 
HVDTTPDPSCLDYSHYEEQSMSEADKVQQFYQLLTSSVDVIKQFAEKJPGYFDLLPED 
.QELLFQSASLELFVLRLAYRAR[DDTKLIFCNGTVLHRTQCLRSFGEWLNDIMEFSRS 
LHNLEIDISAFACLCALTLITERHGLREPKXVEQLQMKIIGSLRDHVTYNAEAQKKQH 

YFSRLLGKLPELRSLSVQGLQRIFYLKLEDLVPAPALIENMFVTTLPF 
40 

12. SEQ ID NO: 12 Accession No. NM_1 76065 Drosophila melanogaster 
Hormone receptor-like in 38 CG1864-PC 

1 ctcgcccatt ggagggcccc tgtcctgtgg cagcagcttg cccagcttcc aggagaccta 
45 61 ctccttgaag tacaacagca geageggtag cagcccccag caggcgtcct cctcctccac 

121 cgccgccccc acgcccactg accaggtget gaccctcaag atggacgagg actgcttccc 

1 8 1 gcctctgtcc ggcggctgga gtgccagtcc gcccgccccc tcccagctcc ageagctgea 

241 caccctgcag tctcaggccc agatgtcgea tcccaacagc agcaacaaca gcagcaacaa 

301 cgcgggcaac agccacaaca acagtggggg ctacaactac cacggccact teaatgecat 
50 361 caatgccagc gecaatctgt cgcccagctc ctcggccagt tccctctacg aatataatgg 

42 1 tgtttccgca geggacaact tetaeggaca acagcagcag cagcaacagc aaagctatca 

48 1 gcaacataac tacaactcgc acaatggega gcgttactcg ctgcccacgt ttcccacgat 

54 1 ttcggagctg gctgcggcca ctgctgctgt egaagctgeg gcggcggcca caglctcctc 

60 1 cccttcggtg ggcggtccgc cgccagtacg ccgagcatcg ctgccggttc agegaacegt 
55 66 1 ttcgccagcc ggctccacgg cgcagagccc caagctggcc aagatcacac tgaaccagcg 
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721 gcactcccai gcccatgccc atgccctaca gctcaactcggcacccaan eggeggcaag 
781 ttcgccagcg agtgcggatc tgcaggcggg ccgttigctc caggctccgi cgcagctgtg 
841 tgccgtttgtggcgacaccgccgcclgcca gcattatgga gtgegaaect gcgagggatg 
90 1 caagggattc ttcaagcgga ccgtgcagaa gggctccaag tatgtctgee tageggacaa 
5 96 1 gaattgcccg gtggacaaga ggcgccgcaa ccgttgccag ttctgccggt tccagaagtg 

1 02 J cctggtcgta ggcatggtca aggaagtggt gcgcacggac tegttgaagg gtcgccgcgg 
1 08 1 gagactgccc tcaaaaccga aatcgcccca ggagtcgeca ccatcaccac ccatctcgtt 
1141 gatcacggcc ctggttcgca gecatgtega cacgactccg gatccctcgt gectggacta 
1201 cagccactat gaggagcagt egatgagega ggcagataag gtgcaacagt tttaccagct 

10 1261 gctgaccagc tccgtggacg tgatcaagca gttcgecgag aagattcccg gctacttcga 
1321 tctcctgccg gaggatcagg agetgetett ccagagcgca tegctggaac tgttcgtcct 
1381 gcggctggcc tatcgcgcca ggatcgatga caeca age tg atettctgea aeggcaeggt 
1441 gctccaccgc acccagtgcc tgcgctcctt cggcgagtgg ctcaacgaca tcatggagtt 
1 501 cagccgcagc ctgcacaacc tggagatcga catctccgcc ttcgcctgcc tctgtgccct 

15 1561 aaccctgatc acagaacgcc atggcctgcg ggagccgaag aaggtggagc agctccagat 

1 62 1 gaagatcatt ggcagtctgc gcgaccacgt cacctacaat gccgaggccc agaagaagca 
1681 gcactacnc agccgcctgc tgggcaagct geeggagctg aggtccctga gtgtccaggg 
1 74 1 actgeagagg atcttctacc tgaagctgga ggacctggtg cccgcgccag ctctcatcga 
1 80 1 gaacatgttc gtcaccacat tgcccttcta gaggegatea teaagegtat catcacaact 

20 1861 tgcttcctta aactagcccc taagttatgc ctcctaggat atacagagaa aggaccccat 

1 92 1 aggaeggacg caactagctt tagtagaacc ctgaaataaa taaatctcac aacagcaaaa 
1 98 1 acaaaaccga accgaacaga aatgaagega atagcagacc caggecatat ctttagtgta 
204 1 gagctaggta gttagccgga cagccccggc tccttcgata attaeggaca tgcatatttg 
2101 agagggggtt tccagtgcac agectatgge tcctgcgtga ctcgtcagca ccgcgagctc 

25 2161 caacttgttg acgttaattg ttaaattgtt taatttcaac tgtcaaaacc ggaatcaacg 

222 1 gccgggcacg caatggcaac actttctatc cccggacttc gaagectget caacattegg 
228 1 cactacggac ggacaaacaa eggacagaaa cagaactcac tcttgctctc ttgccttttg 
234 1 ctaacttcta gtcaattgat ttaggcgaat caaataaata aataaataaa ataagggegt 
240 1 gcagcagtag tgttatataa tttctatgee agaccccagc ggttctcttc aaggaaatcc 

30 246 1 cccaatgagt tgcacaaatt gggataaagt aegatagect attattctta tatttctttt 

252 1 aaaagctcga agatagatga gaactgtgtg gaaatccact atcatatcat atagttgeta 
2581 taagccgtgc ttgccctaag ctaagttaga cccgcataaa gttgatagee caaccaagta 
2641 tttcggttat ttcctagact aaggtcctaa tagttatagg ctaagactat tctgttcgat 
270 1 ttatcaatgc accaaacagt gcacaatgag agtataagta ccttcttgtg atgattgtgt 

35 2761 ctgacacaga gagagttgea cacaagcaca caaactagcc gataagttac taaatacgat 
2821 ctaatatcta atatatataa tataatataa tatatataag tccaagtatt eggaaatcca 
288 1 agaacccttg cataaccgca gttcgtacgt tccaaacgag aaaagaactt tatttaatcc 
294 1 tagaccactc catctaagtt ctcaaagaat cgtatgtgga tcgttggatc tgtctctcta 
300 1 tatatgtgtg tgtgttatct cgatagaaaa cccctctatg tgattttgtg atagattggc 

40 3061 attgaactct atatatttat atatatatgt ctataatata tatacacgea taaatatata 

3121 tttttatgtc taacttttgt atggtttatt ttataegtae cacttttctt tgataacaaa 
3181 aagtaaaaaa ctegttagat agcaaatatt tcaaaggtat gttacgagga cttttcaaag 
324 1 taccagtctt tagegacttt ccaattaacg ttcgtattaa cgaaagacag attttctatg 
3301 tgttaaattg aagacttcta taactataac taaatgcaag ctaagagcaa aaacacaaat 

45 3361 ccacaaatcc ccaaagtgaa taacatatct cttcaagctt tegagtgeae ggaacacgta 

342 1 gaaccgaaac ccaagtgtta ctaaatccat ttaataatcg geaagceggg ggcgtcggcg 
348 1 tggttaatac gttctcatta cctatacaat ttagatagat cattattaaa ttattgtaca 
3541 tgtagcacat gaaatgttcg acaactagat tttgtaccat cttaaagaag aacctaggcc 
3601 aagctaaact aagtataaac tatgatctgc atgcggctga gctgtagcta tgagaaatat 

50 366 1 acctgcgtgg atctaagtga aatgggacac tttgaattta gatatgaaac gttctaaacg 
372 1 egaegtacta actctcccaa ctgcgaactc taccaattaa gagaaattcc cagaaaatgt 
378 1 gtcaggattt caaagcgtcc catctcactt gaacccaccc aatcaacaaa tacaaatcct 
384 1 agggaagttg agaggttcag caaccataga gcaatatttc ataagaaaac gcaccttaaa 
3901 ttaccgaaaa acatagatta acctgatctt gtaacgtttg ggagcgataa taagccagga 

55 3961 ttaaacagga acagttaggt gaccaaatca gttcgaaacg agatgataga taggttcggg 

402 1 ttcgaaaccc taaacgegat gecattttag ccgttacaac attggatatc aaccatgcac 
408 1 atgaatatga atatgaatat gaatattata gagatatatc tagctatagg aacctacttt 
4141 gtacctacac gacatggaaa catcaaacct acatgeatat ttacacacat atattttgaa 
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420! tagagegacg acmtacaa gtigcgtaca aagctatagc tatagcttga tatggecate 

4261 ecagagegag catatacata tatutgggt tattgttctt ttgtaatttt ataaatgeat 

4321 acatatttat tgtactacgt gaatgtcaag tgtggattca taimtgag atacagctac 

438 ) aaaacgaaac aaaagaaaat aaaacaaaac agaagagtaa aegtgaaatt tttcgatgaa 
5 444 1 acaattttaa atgagaactt tttaatattg ctattaaagg atatacatat acacactaac 

4501 atacatatat atmactat gtaaeggata gaattaagct agatgeageg cataaagctt 

456 ) tatacaacaa attgaaaagc aacagaagaa attggcacaa attaaattta tatagcataa 

462 1 ttagaegtec ttegcaagat aatgttattc gtaataagag egtcaategg tacateggge 

468 1 gctatttccc actacacccc caaccacaca atagataacc taagctatgt atgtacatta 
10 4741 gctatgtata tccagcccac ttatgegect actactagaa atgcagaaag cagaaagaga 

4801 ggtgaaacct atagaegcta tcacaaatgt ctatctgata gaeateggta ctaccaatgc 

4861 tatattgeca gttgtgtaat ttactcttat ttgatcgttt catttaccag ttaagaaccc 

492 1 aaatcatata agtgttatga tggaagaact ataacttgea attcaattaa etctgeaata 

498 1 cgataacaag caaagegaat catttcattt cgatttaatc tttaattata tatacttaaa 
1 5 504 1 cgatgtaagc ccaaaacaaa cgttttttct atatctgtct tttgagcaaa ttagttatac 

5101 gcaaaaccaa acegtattta cataaatgta tacaaaacaa ategtatatt ttcattggtt 

5161 tgaaataaat acataaaaca a 

13. SEQ ID NO: 13 Accession No. NM_141390 Drosophila melanogaster 
20 CG10296-PA 

MSNFSACAVCGDQSSGKHYGVSCCDGCSCFFKRSVRRGSSYACI 

ALVGNCVVDKARRNWCPSCRFQRCLAVGMNAAAVQEERGPRNQQVALYRTGRRQAPPS 
QAAPSPTPHSQALHFQ1LAQ1LVTCLRQAKANEQFALLDRCQQDAIFQVVWSE1FVLR 
25 ASHWSLDISAMIDGCGDEQLKRLICEAHQLRADVLELNFMESLILCRKELAINAEYAV 
ILGSHSKAAL1SLARYTLQQSNYLRFGQLLLGLRQLCLRRFDCALSCMFRSWRDILK 

TL 

30 14. SEQ ID NO: 14 Accession No. NIVM41390 Drosophila melanogaster 

CG10296-PA 

1 atgtcgaact tcagtgcctg cgcagtgtgc ggegatcaga getcegggaa gcactacggc 

61 gtgtcctgct gcgatgggtg ctcctgcttt ttcaagcgga gcgtgcggcg egggagcage 
35 121 tacgcctgca tcgctctggt egggaactgt gtggtggaca aggegeggeg gaactggtgt 

1 8 1 ccctcctgcc gcttccagcg atgcctggcc gtgggaatga acgctgctgc ggttcaggag 

24 1 gagegeggtc cgcgcaacca gcaggtggct ctctaccgca ctggccggag acaagctccg 

301 ccatctcagg cggcgccatc cccgacgccc cactcccagg cgctgcactt ccagatcctc 

361 gcccagatcc ttgtcacgtg cctgcgccag gegaaggeca acgagcagtt cgctctgltg 
40 42 1 gatcgctgcc aacaagaege catctttcag gtggtgtgga gegagatett cgtcctgcga 

481 gcgtcccact ggtctctgga catcagcgcc atgatcgacg getgeggega tgagcagctc 

541 aaaeggctea tttgegagge ccaccagcta agggecgacg tcctggaact caactttatg 

601 gagtccctaa tcctgtgcag aaaagaattg gecatcaatg eggagtatge cgttatcctg 

66 1 ggaagecact ctaaagccgc cctgatctcc ttagcccgct acaccctgca gcaatccaac 
45 72 1 tacctgeggt teggacaact gctccttggt ctgaggcagc tgtgcctgag gcgcttcgac 

78 1 tgcgcgcttt cttgtatgtt tegcagegtg gtcagggaca tcttaaaaac actttag 

15. SEQ ID NO: 15 Accession No. NM_169459 Drosophila melanogaster 
seven up CG11502-PC 



50 



MGMRREAVQRGRVPPTQPGLAGMHGQYQIANGDPMGIAGFNGHS 

YLSSYISLLLRAEPYPTSRYGQCMQPNNIMGIDNICELAARLLFSAVEWAK.NIPFFPE 

LQVTDQVALLRLVWSELFVLNASQCSMPLHVAPLLAAAGLHASPMAADRWAFMDHIR 
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IFQEQVEKLKALHVDSAEYSCLICAIVLFTTDACGLSDVTHIESLQEKSQCALEEYCRT 
QYPNQPTRFGKLLLRLPSLRTVSSQVIEQLFFVRLVGKTPIETL1RDMLLSGNSFSWP 

YLPSM 



5 



16. SEQ ID NO: 16 Accession No. NMJ 69459 Drosophila melanogaster 
seven up CG11502-PC 

I ctaaattgtt gttttcaaaa gaaatgaatt tctttccact cctttcagaa ttcaagaata 
10 61 aatattgaag caatatggct tcccttgttc aaaccgatca atcgttgcaa atctttcttc 

121 aagegctegg tgcgacgtaa tctaacttac tcttgccgcg gcagcagaaa ctgtcccata 

1 8 1 gatcaacacc ategcaatea atgtcaatat tgtcgattga agaagtgect caaaatgggc 

241 atgagacgcg aagctgttca acgtggacgc glaccaccca ctcagcccgg tctggccggc 

301 atgcatgggc agtaccagat tgccaacggg gatcccatgg geattgeegg etttaaeggg 
15 361 cactcgtacc tcagttccta catctcgctc ctgctgcggg eggaacegta tccgacttcg 

421 egatatggee agtgcatgca acccaacaac attatgggca tcgacaacat ctgcgaactg 

48 1 gccgcccgac tgctcttctc ggeggtcgag tgggecaaga acataccctt cttcccggag 

54 1 ctgcaggtga ccgaccaggt ggccctgctc cggctcgtct ggtcagagct cttcgtccta 

601 aacgccagcc agtgctccat gccgctccat gtggcgccac tgctggccgc cgccggactt 
20 661 catgcctccc cgatggccgc cgatcgtglg gtggccttca tggaccacat ccgcatcttc 

72 1 caggagcagg tggagaagct gaaggcgctg catgtcgact ccgcggagta ctcctgcctc 

781 aaggegateg tgctcttcac caccgatgcc tgcggcctgt ccgatgtgac gcacattgaa 

841 tccctgcaag agaagtcgea gtgcgccctc gaggaatact gccggaccca gtatcccaac 

901 cagcccacga gatteggcaa gctgcttctc agactgecat cgctgcgaac ggtctcctca 
25 96 1 caagtcattg agcaattgtt ttttgtgcgt etagteggaa aaacgccaat tgaaacgctg 

102 1 atacgegata tgetgetgag eggcaacagt ttctcctggc cctatctgcc ttcgatgtga 

108 1 cacacgatgt ggegecaatt gacaacaact tgatcategg ccgcagctgt ggeggctgea 

1 141 acgctcaaca teaattcegg eggaggegge ateggcateg gcggcggggg cagtggcagt 

1201 ggcggtggcg gtagtggagg cggtggcgga gtcgttggat gtggcagcca caacgttgtc 
30 1261 gctgccagtc atgaccagct cgccaatgtt gctgtcatgc agcaaacata cggcagcggc 

1321 ggcagcagca gcagcagcat cagcggttgc cacaaeggta acaaeggcag cggcggcagc 

138 1 atttgeaate agcagatcaa caactacggc aacaacagca acaacaatgt eggcaatcat 

144 1 atgagtgcag gcagtttttt cggtgggtcc aacaacagca tccacagtag tggcaatagc 

1 50 1 aataccgatt atatgaccac gccagccacc gettatgega caccagcgac agcagccaca 
35 1561 tccacggtga acaccacaac gatgetgtet aattactgcg atgccgccac catgatgatg 

1 62 1 gccgctgctg cagtcaatgc aaatcaatgc ctgcagcaac atcaccagcg catgttgctc 

1 68 1 gegggcagea gcaacagcag cagcaacaac agcagcagca acagcaaegg cgcagcagca 

1 74 1 atgccctcct catcctcgtc tggctcactg tcatctgcct catcgacccc aacagcaaca 

1 80 1 geaactgega ctgcaattgc aacagcaaca geaactgeag cagcaacagc cgcgcagcaa 
40 1861 caacagcaac aatcgccgcc aaatttaatc gatatcagcg aagttcctct cattgtggat 

1 92 1 gtcaagtagt gtaattattt atgeatctag aaatggggct ataaaccaac cttgtagata 

1 98 1 ccccgccccg cccccaccac taccacaaaa accataaaac cccaaaaaaa aaacaattga 

204 1 aaaatgtaaa aaaaaaaagt tggaggatga gcgccgcgta gcttaattga ctaattttcc 

2101 atttgtagct tttgttgtaa ctttgtacat aactcctcga aaaattcaag tttttctcta 
45 2161 ggccacccca gctgtgagca aaaccaatct cagctgacat atccaagaga acttcaaaag 

222 1 tgaagccccc aaaaaaagta agaaggegee aaaaaaacgt ctttacatat gaatgtgtat 

228 1 aatatttaaa tggcactgag ttctacttaa ttttagacca caaacacttg aaaaaatcaa 

234 1 tgaaaaaata agaattgtgg aaagagaaaa atccccccta acactttcaa aagacaaaac 

2401 ataaagatag ttaaaatatt tatatatgta atgtagcata tacaegtata tagtacatat 
50 2461 atgaatatat aaacgaaact ctactcccag tggtttgcag aaatatacca aaaattttaa 

252 1 gctatgttta cttgatgtgt ggcaattttt atgtgtgctt tagcaatttt atttttactt 

258 1 taagtaaaat ttaaaattta taaacattcg attctcgact ggtttttctc ggcggatgta 

264 1 tctcaaagat gcttctgtat gggaaggccg aattgttgaa atacgaatgc aaaatttagc 

2701 gaatttttta tttagtaacc attacgagta aaaacacaaa atgttcagtg caagtttcag 
55 276 1 ttcttaaacg amtttcgt aagcttaagc attatcrtat ttatgtgtat agagtatgaa 

282 1 aagttttcta tattttgtaa taataaaaat ttgcgtttat aatgaa 
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17. SEQ ID NO: 17 Accession No, NMJ)79857 Drosophila melanogaster 
tailless CG1378-PA (til) mRNA 

5 MQSSEGSPDMMDQKYNSVRLSPAASSRILYHVPCKVCRDHSSGK 

HYGIYACDGCAGFFKRSJRRSRQYVCKSQKQGLCVVDKTHRNQCRACRLRKCFEVGMN 
IO)AVQHERGPRNSTLRR^MAMYKJ)AMMGAGEMPQIPAEILMNTAALTGFPGVPMPMPG 
LPQRAGHHPAHMAAFQPPPSAAAVLDLSVPRVPHHPVHQGHHGFFSPTAAYMNALATR 
A LPPTPPLMA AEHI ICETA A EH LFKN VNWIKSV RA FTELPM PDQLLLLEES WICEFFILA 
10 MAQYLMPMNFAQLLFVYESENANREIMGMVTREVHAFQEVLNQLCHLNIDSTEYECLR 
AISLFRKSPPSASSTEDLANSSILTGSGSPNSSASAESRGLLESGKVAAMHNDARSAL 
HNYIQRTHPSQPMRFQTLLGVVQLMHKVSSFTIEELFFRKTIGDITIVRLISDMYSQRKJ 

18. SEQ ID NO: 18 Accession No. NMJ)79857 Drosophila melanogaster 
1 5 tailless CG1378-PA (til) mRNA 

] gagtccacat eggagtaace aaggatatat cgaatatatc acacaatccg caataccgcc 
61 gtccacccaa accgttaaaa caaaaatcca aaacgactca aagatacacc agtgccaagt 

1 2 1 gaaattcaat ttgtgcaagc gtttctacaa aaatcgccaa aattacgccc cacateggta 
20 181 tgcagtcgtc ggagggttca ccagacatga tggatcagaa atacaacagc gtgcgtcttt 

24 1 cgccagcggc ategagtege attctatacc atgtgccctg caaagtctgc agagatcaca 

301 gctccggcaa geattaegge atctacgcct gtgatggctg cgccggattc ttcaagagga 

361 geatteggag atcccggcag tatgtgtgca agtcgcagaa gcagggactc tgtgtggtgg 

42 1 acaagacgea caggaaccaa tgtagggctt gecgactgag gaagtgcttt gaggteggaa 
25 48 1 tgaacaagga tgcagtgcag cacgageggg gaecgeggaa ctccactctg cgtcgccaca 

541 tggccatgta caaggatgee atgatgggcg ceggegagat gecacaaata cccgccgaaa 

601 ttctgatgaa cacggctgcc ttgacegget ttcctggagt accgatgccc atgcctggcc 

661 tgccccagag ggctggtcat catcctgctc acatggctgc cttccagccg ccaccatcgg 

72 1 ctgccgctgt cttggactta tccgtgccac gagtgcccca tcacccggtg caccaaggac 
30 781 accacggttt cttctcgccc accgccgcct acatgaatgc cctggccact cgggccctgc 

84 1 cccccactcc teegctgatg gcagctgagc acatcaagga aaccgcggcg gaacacctat 

901 teaagaaegt caactggatc aagagegtae gggccttcac cgaactgccc atgeeggate 

961 agctgctcct gctggaggag tcctggaagg agttcttcat cctggccatg gcccagtacc 

1021 taatgeccat gaatttcgee cagctgetgt tegtctaega gtccgagaat gccaaccggg 
35 1 08 1 agatcatggg catggtgacc cgcgaggtgc acgccttcca ggaggtgctg aaccaactgt 

1 141 gecatctgaa caltgacagc accgagtacg agtgtctgag ggctatttcg ctcttccgta 

1 20 1 agtcaccacc gtcggcaagt tctaccgagg atttagecaa cagctcaatc ctgacaggaa 

1 26 1 gcggcagccc gaactcctcg gcctctgctg aatccagggg tcttctggag tegggaaaag 

1 32 1 tggeggecat gcacaacgat geceggagtg cgctgcacaa ctacatccag aggacccatc 
40 1381 cctcgcagcc catgegatte cagacgctct tgggcgtggt gcagctgatg cacaagglct 

1441 caagcttcac catcgaggag ctgttcttcc gaaagaccat cggcgacatc accattgtgc 

1 50 1 gcctcatctc cgacatgtac agtcagegea agatctgaaa agtatgtaga gectagacta 

1561 atcgccgcac tegaagtgee ttccaagtgc tgggaactgt gataatctcg gaagaagege 

1621 tttggacaat actcgatcag tgaaatcaac gatttctcat atccaggagt egagecttaa 
45 1 68 1 aataegtaca caacactcac cttaatacct tacctaaaca gaactcgaag taatcttagc 

1741 taaagtctct cagaccatcc agatgtgttt caaattgeat tegcaaaagt ttcaactttg 

1801 cctgttaaat aegtcaateg tagttttaaa cactttagtt ttaagegcat attattagct 

1 86 1 ttaggatttg gaaaaataat tattc 

50 19. SEQ ED NO:19 Accession No. NM_057792 Drosophila melanogaster 

dissatisfaction CG9019-PA 

MGTAGDRLLDI PCK VCG DRSSG KH YG 1 YSCDGCSGFFKRSI H RN 
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RIYTCKATGDLKGRCPVDKTHRNQCRACRLA KCFQSAMNKDAVQHERGPRKPKLHPQL 
HHHHHHAAAAAAAAHHAAAAHHHHHHHHHAHAAAAHHAAVAAAAASGLHHHHHAMPVS 
LVTNVSASFNYTQHISTHPPAPAAPPSGFHLTASGAQQGPAPPAGHLHHGGAGHQHAT 
AFHHPGHGHALPAPHGGVVSNPGGNSSA1SGSGPGSTLPFPSHLLHHMLIAEAAS1CLP 
5 GITATAVAAVVSSTSTPYASAAQTSSPSSNNHNYSSPSPSNSIQSISSIGSRSGGGEE 
GLSLGSESPRVNVETETPSPSNSPPLSAGS1SPAPTLTTSSGSPQHRQMSRHSLSEAT 
TPPSH ASLM ICASNNNl^WWh^ 

ASSSSGFLELLLSPDKCQEL1QYQVQHNTLLFPQQLLDSRLLSWEMLQETTARLLFMA 
VRWVKCLMPFQTLSKNDQHLLLQESWICELFLLNLAQWTIPLDLTPILESPLIRERVLQ 
10 DEATQTEMKTIQEILCRFRQITPDGSEVGCMKAIALFA PETAGLCDVQPVEMLQDQAQ 
CILSDHVRLRYPRQATRFGRLLLLLPSLRTIRAATIEALFFKETIGNVPIARLLRDMY 

TMEPAQVDK 

1 5 20. SEQ ID NO:20 Accession No. NM J)57792 Drosophila melanogaster 

dissatisfaction CG9019-PA 



1 gtcagcccag gcgatccgca tttgcgtccg cagcaggttt ccgatttcag aactctgatt 
61 ccagcggcag egaatcgegt eggcatctga acatttgaaa ataatctaaa attgeaagtg 

20 121 actttgtgca ccggttacac taaaattgtt aacaaatege catatattct gaatttaaat 

181 ttaaagtgcg cagtgcggaa tataaatcag agcaaactgg ataegttagg gttcaaatac 
24 1 ttccatcaac ggaaaatggg cacagcgggc gatcgectgt tggacattcc ctgcaaggtg 
30 1 tgtggcgatc gcagctccgg caagcactat ggaatctaca getgegatgg ctgctccggt 
361 tttttcaagc ggagcattca tegcaategg atttacacct gtaaggecac cggcgatctc 

25 42 1 aagggtcget gtccggtgga caagacccat eggaatcagt gtcgcgcctg tcgcctggcc 

481 aagtgcttcc agteggecat gaacaaggat gctgtgcagc aegagegegg tcctaggaaa 
541 cccaagttgc acccgcaact gcatcatcat catcatcatg ctgctgccgc cgccgctgca 
601 gcgcatcatg cagcagccgc ccatcaccat caccatcatc accaccacgc ccacgcagcg 
661 gccgcccatc atgcggcagt ggctgcagcg gctgcctccg ggctgeatea ccaccaccac 

30 72 1 gccatgcccg tctcgctggt gaccaatgtc tcggcctcgt tcaactatac gcagcacatc 

781 tccacgcatc cgcctgctcc ggcggcgcca cccagtggct ttcacctgac ggccagtggc 
841 gcccagcagg gaccagctcc accagctggc cacctgcacc atggtggagc eggacatcag 
901 cacgccacgg ccttccacca teegggacat ggacacgcgc tgcctgcccc acatggegge 
961 gtegtcagea atcccggcgg caactcgagc gcaatctccg gcagcggtcc cggctccacg 

35 1 02 1 ctgcccttcc cctcgcacct gctgcaccac aatctgatag eggaggegge cagcaagctg 

1081 ccgggcatca ctgccacagc cgttgcggcg gtggtgtcct ccactagcac gccctacgcc 
1141 tcggcggccc agaegtegtc gectagtage aacaaccaca actactcctc gccctcgccc 
1201 agcaactcca tccagtccat ctcgagcatt ggatcgegea gcggtggtgg cgaggagggc 
1261 ctcagcctgg geagegagag tccgcgcgtc aatgtggaaa cggagacacc ttcgccatcg 

40 1321 aactcgccgc cccttagtgc tggtagcatt tcgccagcgc ccacgttgac cacctcgtcg 

1381 ggatcgccgc agcaccgcca gatgtcgegg cacagcctca gtgaggcaac cacgccgccc 
1441 agccacgcct ctctcatgat ttgcgccagc aacaataaca ataacaacaa taataataac 
1 50 1 aataatggag agcacaagca gtcgagctac acatceggat caccgacacc cacaacgccc 
1561 acgccgccac cgccgcgttc tggtgtaggt tccacctgca acacggccag cagctccagc 

45 1 62 1 ggcttcctgg agetgetget cagtccggac aagtgccagg agctcatcca gtaccaggtg 

1 68 1 cagcacaaca cgctgctctt cccgcaacag ctgttggact cgcggctgct ctcctgggag 
1741 atgetgeagg agacgaegge gcgactgctc ttcatggcgg tgcgctgggt caagtgcctc 
1801 atgcccttcc agacgctctc caagaacgac cagcatttgc tgctccagga atcctggaag 
1 86 1 gagctcttcc tgctcaacct cgcccaatgg actataccgc tggatctaac gcccatactg 

50 1921 gaatcaccgc tcatccgcga acgggtgctg caggacgagg ccacacaaac ggagatgaag 

1 98 1 acgatccagg agatcctctg ccgcttccgc cagatcacac ccgacggcag cgaggtgggc 
2041 tgcatgaagg ccatcgccct gttcgcaccc gaaaccgccg gcctgtgcga cgtgcagccg 
2101 gtggagatgl tgeaggatea ggcgcagtgc atcctctccg accatgtgcg actgcgctac 
2161 cctcgccaag caacccgctt eggcaggctg ctgctcctgc tgccctcgct gcgcaccatc 

55 222 1 cgggcggcca ccatcgaggc gctgttcttc aaggagacca teggcaatgt gcccattgct 

2281 cgactgctgc gcgacatgia caccatggaa ccggcacagg tggacaagtg aaccggccac 
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2341 gcatgacagt cgaaaigaaa tcaaaatcga ttccctagca cctaagcgcc acccatcggi 
240 1 egtegtcata tgegaactta tttgtaitcc aatgcgaccc gaatcctatt cagattcact 
2461 geggcaggag gcggtccaaa tgtggggcgg aagctgeaga tgctatggtt cgcaggacgc 
2521 catgtaatgg aggegtatgt actaaccgcg ctcctccatt ggcgatgcag tecgegatga 
5 2581 tggcgcactc ccacacccac acccgtaccc acaccttgat ttategcegg caatgegteg 
264 1 gagtctcctt actttcgett cgttttctaa catttgtatc cttattttat ttcatctttt 
2701 tecaeggatt tttcgttttg actgcctggg cggcactctt tatttatctt tcattcgacg 
2761 ttttgtcgtc gcttttctaa aaattcccca tgttatttca acctggcaag gacctcgcag 
2821 tcccattccc gcgcccttac ttacaaatca cttcccatcc cacatccagc aattccgtgg 
10 2881 tttgaattct ttcgtgcatt gactacgaaa taccctttaa tcagacaaat aaagaatatt 
2941 agttgtaatt cttttttctg caatccagct etaaaaeggg tttcttaatc gaaatcgata 
3001 aatgtaaaaa ttatacatat cctttaccaa cattgtttgc eta 

21. SEQ ID NO: 21 NM_1 66092 Drosophila melanogaster CG36801-PA 

15 

MATGRSLLFRVPWYVCLCVCAESAEPGVYWRLRLRLGLPTLAGP 
HTNTLTLTARTSSCRSIKXERIKASQQANAPPELPLKVSVDVNHIAAHSQRRR1GLV 
RFHQRESEDRPLAVASPRLQINMEPTAMNPKICLHSPQRHCYTPPPAPMHGQAPPPTST 
GVAPPTQPPPPHPAAPNVPNGRLLSWNHSAAAAAAAAAAQAAANSMNHSSAAEGSSMT 
20 RIKGQNLGLICVVCGDTSSGKHYGILACNGCSGFFKRSVRRKLIYRCQAGTGRCVVDK 
AHRNQCQACRLKXCLQMGMNKDDDSlDVTNDNEEPHAVSRSDSSFnslPQFMSPNLYTH 
QHETVYETSARLLFMAVKWAKNLPSFARLSFRDQVILLEESWSELFLLNAIQWCIPLD 
PTGCALFSVAEHCNNLENNANGDTCITlCEELAADVRTLHEfFCKYKAVLVDPAEFACL 
KAIVLFRPETRGLKDPAQIENLQDQAHHTKTQFTAQIARFGRLLLMLPLLRMISSHK1 

25 

ESIYFQRTIGNTPMEKVLCDMYKN 

22. SEQ ID NO: 22 NMJ. 66092 Drosophila melanogaster CG16801-PA 

30 1 atggcgaccg ggcgttctct gctctttcga gtgccttggt atgtgtgctt gtgtgtgtgc 

61 geagagageg cagagceggg tgtttattgg agattgegat tgcggcttgg cttacccaca 

1 2 1 ctcgcagggc cgcacaccaa cacactaaca etaacagega ggacaagctc ctgccgcagc 

181 atcaagaagg aacgaatcaa ageaagecaa caagcaaatg cgccaccaga gttgecacta 

241 aaagtctccg ttgacgttaa catcatcatc gcggcacact cgcagcgccg teggategga 
35 301 ttggttcggt neatcageg ggaatcagag gaccgtccac ttgccgtcgc ctctccacga 

361 ttgeaaatta atatggagee tactgegatg aacccgaaaa aactccacag tccgcagcgg 

421 cattgetaca ctccgccgcc ggcgccgatg caeggacagg cgcctccacc tacatcaacg 

48 1 ggcgtggccc cgcccacaca gccaccgccc cctcatcccg ccgccccaaa cgtgcccaat 

541 ggtcgattgc tgagctggaa tcacagtgcc getgeagctg ctgcggcggc ggcagcccaa 
40 601 gcggcagcca actccatgaa ccactcglcg geggeggagg gttcatcgat gaeceggatt 

661 aagggtcaga acctgggcct catctgegtg gtgtgcggcg acaccagctc gggaaagcac 

72 1 taeggaatec tagectgeaa tggctgctcc ggattcttca aacgeagegt gcggcggaaa 

781 ctcatttatc gctgccaggc gggaacggga cgctgtgtgg tggacaaagc teateggaat 

84 1 caatgecagg cctgcaggct caagaagtgc cttcaaatgg gaatgaacaa ggacgacgac 
45 901 tccatagatg taaccaacga caacgaggag ccgcatgcag tcagcagatc ggattcgagt 

961 ttcattatgc cgcagttcat gtcgcccaat ctgtacaccc atcaacacga aacagtttac 

102 1 gagacaagtg cccggctgct cttcatggcc gtcaagtggg ccaagaacct gcccagcttt 

1081 gcaagacttt ecttteggga tcaggtaatt ttgctggagg agtcctgglc ggagctgttc 

1141 ctgctgaacg caatccaatg gtgcattccc ctggatccca ccggctgcgc cctcttctcg 
50 1 20 1 gtggcggagc actgeaataa tctagagaac aatgccaatg gcgacacttg cataacaaag 

1261 gaggagctgg eggeggatgt gcgaacgctc cacgagatct tetgeaaata caaggeggtg 

132 1 ctggtggacc cegctgaatt cgcgtgcctc aaggegatag ttctcttccg geeggaaacg 

138 1 cgeggactta aagatcegge gcagatagag aatcttcagg ateaggegea ccacacaaag 

144 1 aegcagttea ccgcccagat agecagatte ggacgactcc ttctcatget gccgttgctg 
55 1 50 1 egcatgatea gcicccacaa gattgagtcc atctattttc agegcactat tgggaacacg 

1561 cccatggaaa aggtgctctg tgacatgtat aagaactag 
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23. SEQ ID NO: 23 Accession No. NM_J68258 Drosophila melanogaster 
estrogen-related receptor CG7404-PA (ERR) 

5 MSDGVSILHIKQEVDTPSASCFSPSSKSTATQSGTNGLKSSPSV 

SPERQLCSSTTSLSCDLHNVSLSNDGDSLKGSGTSGGNGGGGGGGTSGGNATNASAGA 
GSGSVRDELRRLCLVCGDVASGFHYGVASCEACKAFFKRTIQGN1EYTCPANNECEIN 
KKRRKACQACRFQKCLLMGMLKEGVRLDRVRGGRQKYRRNPVSNSYQTMQLLYQSNTT 
SLCDVKILEVLNSYEPDALSVQTPPPQVHTTS1TNDEASSSSGS1KLESSVVTPNGTC 
10 IFQ^WN^WDPNEll^VLSDIYDICELVSVlGWAKQIPGFIDLPLNDQMKJLLQVSWAEIL 

TLQLTFRSLPFNGKLCFATDVWMDEHLAK£CGYTEFYYHCVQLAQRMERJSPRREEYY 
LLKALLLANCD1LLDDQSSLRAFRDTILNSLNDVVYLLRHSSAVSHQQQLLLLLPSLR 

Q A DDI LRRFWRGI A RDE V ITM KKLFLEM LE PLA R 

15 

24. SEQ ID NO: 24 Accession No. NM_168258 Drosophila melanogaster 
estrogen-related receptor CG7404-PA (ERR) 

I ccctggtcag gtctggttca ccaaaaaaga aaataaaatt acatttcaat ctttccaata 
20 61 tgeaaatate tgcacgaaaa ecagegagaa cagcatgctc acaataaaga gcccccaaac 

1 2 1 aatgtgactc gtatccgcgc agagtgacgt ttcgtgcctt gcccgagtgc caaatccaaa 

1 8 1 tcccaatcca ggcgcacaaa ategatgeag atgctgtctg cattctcata gaaagtgcaa 

24 1 ctgaataacc gatggtcgee aaaagccacg atgtccagta ataatgacca gtgaataaac 

301 aattatgact cgagcatcga aaaatgctga ggaacgaata cataagcaat aacaagaagg 
25 361 tgctcaactc ggaccaaaac aagtactaca tgctaacggt egaggaggee gatatgtatt 

421 gacgttgtta cagtggagct gattacacaa aagatcctca gaacgatttt atccaaggca 

481 cgaacatgtc cgacggcgtc ageatcttge acatcaaaca ggaggtggac actccatcgg 

541 cgtcctgctt tagtcccagc tccaagtcaa cggccacgca gagtggcaca aacggcctga 

601 aatcctcgcc cteggttteg ceggaaagge agetctgeag ctcgacgacc tctctatcct 
30 661 gegatttgea caatgtatcc ttaagcaatg atggcgatag tctgaaagga agtggtacaa 

721 gtggcggcaa tggcggagga ggaggtggtg gtacgagtgg tggaaatgcg accaatgega 

781 gtgeeggage tggatcggga teegtcaggg acgagctccg ccgattgtgt ttggtttgtg 

84 1 gcgatgtggc cagtggattc cactatggtg tggcgagttg tgaggcttgc aaagcgttct 

901 ttaaaegcac catccaaggc aacatcgagt acacgtgtcc ggegaacaac gagtgtgaga 
35 961 ttaacaagcg gagaegcaag gcctgccaag cgtgtcgctt ccagaaatgt ctactaatgg 

102 1 geatgetcaa ggagggtgtg cgcttggatc gagttcgtgg aggaeggcag aagtaccgaa 

1 08 1 ggaatcctgt atcaaactct taccagacta tgeagctget ataccaatcc aacaccacct 

1141 cgctgtgcga tgtcaagata ctggaggtgc tcaattcata tgagceggat gecttgageg 

1201 tccaaacgcc gccgccgcaa gtccacacga ctagcataac taatgatgag gcctcatcct 
40 1261 cctcgggcag cataaaactg gagtccagcg ttgttacgcc caatgggact tgeattttec 

132 1 aaaacaacaa caacaatgat cccaatgaga tactaagegt ccttagtgat atttacgaca 

1381 aggaattggt cagegtcatt ggctgggcca agcagatacc tggctttata gatctgccac 

1 44 1 ttaacgacca gatgaagctt ctccaggtgt cgtgggcaga gatcctgacg ctccagctga 

1501 ccttccggtc cctaccgttc aatggcaagt tatgettege cacggatgtc tggatggatg 
45 1 56 1 aacatttggc caaggagtgc ggttacacgg agttctacta ccactgcgtc cagatcgcac 

1 62 1 agcgcatgga aagaatatcg ccacgaaggg aggagtacta ettgetaaag gcgctcctgc 

168 1 tggccaactg cgacattctg ctggatgatc agagttccct gcgcgcattt cgtgatacga 

174 ! ttcttaattc tctaaacgat gtggtctact tgetgegtea ttcgtcggcc gtgtcgcatc 

1801 agcaacaatt getgettttg ctgccttcgc tgeggcagge ggatgatatc ctgegaagat 
50 1 86 1 tttggcgtgg aattgeaege gatgaagtca ttaccatgaa gaaactgttc etcgagatge 

1 92 1 tcgagccgct ggccaggtga aaaggattat gcgggcgccc aaactagttg atctagctga 

] 98 1 taagcaaagg tgcaaatata gtcttaggta tatatggatg tatactagag tagattaagc 

204 1 gtaggataag ccatgtatat aaatagtaaa atacttgtcg ggtaagatta gttcgcagaa 

2101 aaaatctctt ttaatggact accaactaca gcaactggaa aaccctactt atcttctaga 
55 2161 aicggggtgt gcltacactg gttaaaggcg catataggtg ttatgtgtct aaagttgtga 
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222 1 gtcacagatc ucaaiaatt tgttcaauc tcactggitc tgatatatgi atatgecgea 
2281 accitctgat gtaacgtatg aatllgtggg caemtaaa atacgalagt ggttctacaa 
2341 tacaatggat tatactgUt claagtgtca tgtaacccag igattctgtg ictatgtggt 
2401 acacatgegg tcaaaagaat agcaatgtcg teegtgaata ataaacegtt tgtaactgtt 
5 246 1 gtttccatac tccctaagtt ctgtattctt tggggatttt cttttcctaa acaaattcaa 
2521 attagtttt 

25. SEQ ID NO: 25 Accession No. NM_168908 Drosopfaila melanogaster 
Hormone-receptor-like in 78 CG7199-PC 

10 

MDGVKVETFIKSEENRAMPLIGGGSASGGTPLPGGGVCMGAGAS 
ATLSVELCLVCGDRASGRHYGAISCEGCKGFF1CRSIRKQLGYQCRGAMNCEVTKHHRN 
RCQFCRLQKCLASGMRSDSVQHERKPrVDRKEGIIAAAGSSSTSGGGNGSSTYLSGKS 
GYQQGRGKGHSVKAESAATPPVHSAPATAFNLNENIFPMGLNFAELTQTLMFATQQQQ 

15 QQQQQHQQSGSYSPD1PKADPEDDEDDSMDNSSTLCLQLLANSASNNNSQHLNFNAGE 
VPTALPTTSTMGL1QSSLDMRVIHICGLQLLQPIQNQLERNGNLSVKPECDSEAEDSGT 
EDAVDAELEHMELDFECGGNRSGGSDFAINEAVFEQDLLTDVQCAFHVQPPTLVHSYL 
NIHYVCETGSRIIFLTIHTLRKVPVFEQLEAHTQVKLLRGVWPALMAIALAQCQGQLS 
VPTIIGQFIQSTRQLADIDJC1EPLKJSKMANLTRTLHDFVQELQSLDVTDMEFGLLRL 

20 ILLFNPTLLQQRKERSLRGYVRRVQLYALSSLRRQGGIGGGEERFNVLVARLLPLSSL 

DAEAMEELFFANLVGQMQMDALFPFILMTSNTSGL 

26. SEQ ID NO: 26 Accession No. NM_1 68908 Drosophila melanogaster 
25 Hormone-receptor-like in 78 CG7199-PC 

1 attggaacaa ggagatttta ttgcgttaga aaaggttcaa aataggcaca aagtgcctga 

61 aaatatcgta actgacegga agtaacataa ctttaaccaa gtgcctcgaa aaatagatgt 

121 ttttaaaagc tcaagaatgg tgataacaga cgtccaataa gaattttcaa agagecaaat 
30 181 gtttgggttt cagttattta tacagccgac gactattttt tagccgcctg ctgtggcgac 

24 1 aatggacggc gttaaggttg agaegttcat caaaagegaa gaaaaccgag cgatgccctt 

301 gateggagga ggcagtgcct caggcggcac tcctctgcca ggaggcggcg tgggaatggg 

3 6 1 ageeggagea tccgcaacgt tgagcgtgga gctgtgtttg gtgtgcgggg accgcgcctc 

421 cgggcggcac taeggageca taagctgega aggctgeaag ggattcttca agegctcgat 
35 481 ccggaagcag ctgggctacc agtgtcgcgg ggctatgaac tgegaggtea ccaagcacca 

541 caggaategg tgccagttct gtcgactaca gaagtgcctg gccagcggca tgcgaagtga 

60 1 ttctgtgcag cacgagagga aaccgattgt ggacaggaag gaggggatca tcgctgctgc 

661 eggtagctea tccacttctg gcggcggtaa tggctcgtcc acctacctat ccggcaagtc 

72 1 eggctatcag caggggegtg gcaaggggca cagtgtaaag gccgaatccg cggccacgcc 
40 78 1 tccagtgcac agcgcgccag caacggcctt caatttgaat gagaatatat tcccgatggg 

841 tttgaatttc gcagaactaa cgcagacatt gatgttcgct acccaacagc agcagcaaca 

901 acagcaacag catcaacaga gtggtagcta ttcgecagat anccgaagg cagatcccga* 

961 ggatgacgag gacgactcaa tggacaacag cagcacgctg tgcttgcagt tgctcgccaa 

1 02 1 cagcgccagc aacaacaact cgcagcacct gaactttaat gctggggaag tacccaccgc 
45 1 08 1 tctgcctacc acctcgacaa tggggcttat tcagagttcg ctggacatgc gggtcatcca 

1141 caagggactg cagatcctgc agcccatcca aaaccaactg gagegaaatg gtaatctgag 

120 1 tgtgaagccc gagtgegatt cagaggegga ggacagtggc accgaggatg ccgtagacgc, 

1 26 1 ggagctggag cacatggaac tagactttga gtgcggtggg aaccgaagcg gtggaagcga 

1321 ttttgetate aatgaggegg tctttgaaca ggatcttctc accgatgtgc agtgtgcctt 
50 1381 tcatgtgcaa ccgccgactt tgglccactc gtatttaaat attcattatg tgtgtgagac 

144 1 gggctcgega atcatttttc tcaccatcca tacccttcga aaggttccag ttttcgaaca 

1501 attggaagee catacacagg tgaaactcct gagaggagtg tggecagcat taatggctat 

1561 agctrtggcg cagtgtcagg gtcagctttc gglgcccacc attateggge agtttattca 

162 1 aagcactcgc cagctagegg atatcgataa gatcgaaccg ttgaagatct cgaagatggc 
55 1 68 1 aaatctcacc aggaccctgc acgactttgt ccaggagctc cagtcactgg atgttactga 
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1 74 1 tatggagm ggcngctgc gtctgatctt gctcttcaat ccaacgctct tgcagcagcg 
1 80 1 caaggagegg tegngegag gctacgtccg cagagtccaa ctctacgctc tgtcaagttt 
1 86 1 gagaaggcag ggtggcatcg geggeggega ggagegem aaigttctgg tggctcgcct 
1 92 1 tcttccgctc ageagectgg aegcagagge catggaggag ctgllcttcg ccaacttggt 
5 1 98 1 ggggcagatg cagatggatg ctcttattcc gttcataclg atgaccagca acaccagtgg 

204 1 actgtaggcg gaattgagaa gaacagggcg caagcagatt cgctagactg cccaaaagca 
2101 agactgaaga tggaccaagt gegggcaata catgtagcaa ctaggcaaat cccattaatt 
2161 atatatttaa tatatacaat atatagttta ggatacaata ttclaacata aaaccatggg 
2221 tttattgttg ttcacagata aaatggaatc gatttcccaa taaaagcgaa tatgttttta 
10 2281 aacagaat 

27. SEQ ID NO: 27 Accession No. NM_057433 Drosophila melanogaster 
ultraspiracle CG4380-PA (usp) 

15 MDNCDQDASFRLSHIKEEVKPDISQLNDSKNSSFSPKAESPVPF 

MQAMSMVHVLPGSTMSASSNNNSAGDAQMAQAPNSAGGSAAAAVQQQYPPNHPLSGSKH 
LCSICGDRASGKHYGVYSCEGCKGFFKRTVRKDLTYACRENRNC11DKRQRNRCQYCR 
YQKCLTCGMKREAVQEERQRGARNAAGRLSASGGGSSGPGSVGGSSSQGGGGGGGVSG 
GMGSGNGSDDFMTNSVSRDFS1ER1IEAEQRAETQCGDRALTFLRVGPYSTVQPDYKG 

20 AVSALCQVVT^KQLFQMVEYARMMPHFAQVPLDDQVILLKAAWIELLIANVAWCSIVSL 
DDGGAGGGGGGLGHDGSFERRSPGLQPQQLFLNQSFSYHRNSA1KAGVSA1FDRILSE 
LS VKM KRLNLDRRELSCLKA 1 1 LYNPDI RG IKSRA EI EMCREK V Y ACLDEHCRLEHPG 
DDGRFAQLLLRLPALRSISLKCQDHLFLFRITSDRPLEELFLEQLEAPPPPGLAMKLE 

25 28. SEQ ID NO: 28 Accession No. NM_057433 Drosophila melanogaster 

ultraspiracle CG4380-PA (usp) 

1 aaaaatgtcg aegegaaaaa aggtatttat tcattagtca gaaagtctgg cattctttgt 

61 ttgttggtaa aaagcgcaat tgtttggagg egagegaata aagtgcgctg ctccatcggc 
30 121 tcaagattat gtaaatgcag caacgacccc accaacaacg aaactgeaac ctgctccact 

1 8 1 tggcccaacg gaccaatagc ggaeggaegg acacggtggc gttggcaaag tgaaacccca 

241 acagagaggc gaaagegage caagacacac cacatacaca cgaagagaac gagcaagaag 

30 1 aaaceggtag geggaggagg cgctgccccc agttcctcca atatacccag caccacatca 

361 caagcccagg atggacaact gcgaccagga cgccagcttt cggctgagcc acatcaagga 
35 42 1 ggaggtcaag ccggacatct cgcagctgaa cgacagcaac aacagcagct tttcgcccaa 

48 1 ggccgagagt cccgtgccct teatgeagge catgtccatg gtccacgtgc tgcccggctc 

541 caactccgcc agctccaaca acaacagcgc tggagatgee caaatggege aggcgcccaa 

601 ttcggctgga ggctctgccg ccgctgcagt ccagcagcag tatccgccta accatccgct 

661 gageggcage aagcacctct gctctatttg eggggategg gccagtggca ageactaegg 
40 72 1 cgtgtacagc tgtgagggct gcaagggctt etttaaaege acagtgegea aggatctcac 

78 1 ataegcttge agggagaacc geaactgeat catagacaag eggcagagga accgctgcca 

84 1 gtactgccgc taccagaagt gcctaacctg eggcatgaag cgegaagegg tccaggagga 

901 gcgtcaacgc ggcgcccgca atgcggcggg taggctcagc gecageggag geggcagtag 

96 1 eggtccaggt teggtaggeg gatccagctc teaaggegga ggaggaggag gcggcgtttc 
45 1 02 1 tggcggaatg ggcageggea acggttctga tgacrtcatg accaatagcg tgtccaggga 

1 08 1 tttctcgatc gagegcatea lagaggcega geagegageg gagacccaat geggegateg 

1141 tgcactgacg ttcctgcgcg ttggtcccta ttccacagtc cagccggact acaagggtgc 

1 20 1 cgtgtcggcc ctgtgccaag tggtcaacaa acagctcttc cagatggtcg aatacgcgcg 

1 26 1 catgatgecg cactttgccc aggtgecget ggacgaccag gtgattctgc tgaaagcege 
50 1 32 1 ttggatcgag ctgctcattg cgaacgtggc ctggtgcagc ategtttege tggatgaegg 

1 38 1 cggtgccggc ggcgggggcg gtggactagg ccacgatggc tcctttgagc gacgatcacc 

1441 gggccttcag ccccagcagc tgttcctcaa ccagagcttc tcgtaccatc gcaacagtgc 

1 50 1 gatcaaagee ggtgtgtcag ccatcttcga ccgcatattg teggagctga gtgtaaagat 

1561 gaagcggclg aatctcgacc gaegegaget gtcctgcttg aaggecatea lactgtacaa 
55 1 62 1 cccggacata cgegggatea agagceggge ggagatcgag atgtgccgcg agaaggtgta 
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1681 cgcttgcctg gacgagcact gccgcclgga acaiccgggc gacgatggac getttgegea 
1741 actgetgetg cgtclgcccg etttgegate gatcagectg aagtgccagg alcacctgtt 
1801 cctcttccgc attaccagcg accggccgct ggaggagctc tttctcgagc agclggag'gc 
1 861 gccgccgcca cccggcctgg cgatgaaact ggagtagggt cccgactcta aagtctcccc 
5 1 92 1 cgttctccat ccgaaaaatg tttcattgtg attgcgtttg tttgeattte tcctctctat 

1981 cccttatacc ctacaaaagc cccctaatat tacgeaaaat gtgtatgtaa ttgtttattt 
204 1 tttttttatt acctaatatt attattatta ttgatataga aaatgttttc cttaagatga 
2101 agattagect cctcgacgtt tatgtcccag taaacgaaaa acaaacaaaa tccaaaactt 
2161 gaaaagaaca caaaacacga acgagaaaat gcacacaagc aaagtaaaag laaaagttaa 
10 222 1 actaaagcta aacgagtaaa gatattaaaa taacggttaa aattaatgea tagttatgat 
2281 etacagaegt atgtaaacat acaaattcag cataaatata tatgtcagca ggegcatate 
2341 tgcggtgctg gccccgttct aaatcaattg taattacttt ttaacataaa tttacccaaa 
240 1 acgttatcaa ttagatgega gatacaaaaa tcaccgacga aaaccaacaa aatatatcta 
246 1 tgtataaaaa atataaactg cataacaa 

15 

29. SEQ ID NO: 29 Accession No. NM_168757 Drosophila melanogaster 
Ecdysone-induced protein 75B CG8127-PD 

MGEELPILKGlLKGm^YHNAPVRFGRVPKJlEKARlLAAMQQST 
20 QNRGQQRALATELDDQPRLLAAVLRAHLETCEFTKEKVSAMRQRARDCPSYSMPTLLA 
CPLNPAPELQSEQEFSQRFAHV1RGVIDFAGM1PGFQLLTQDDICFTLLKAGLFDALFV 
RL1CMFDSS1NS11CLNGQVMIUU)AIQNGANARFLVDSTFWAERMNSMNLTDAE1GL 
FCAIVLITPDRPGLRNLELIEFCMYSRLKGCLQYIVAQNRPDQPEFLAKJLLETMPDLRT 
LSTLHTEKXVVFRTEHK^LLRQQMWSMEDGNNSDGQQNKSPSGSWADAMDVEAAKSPL 
25 GSVSSTESADLDYGSPSSSQPQGVSLPSPPQQQPSALASSAPLLAATLSGGCPLRNRA 
NSGSSGDSGAAEMDIVGSHAHLTQNGLTITPIVRHQQQQQQQQQIGILNNAHSRNLNG 
GHAMCQQQQQHPQLHHHLTAGAARYRKLDSPTDSGIESGNEKNECKAVSSGGSSSCSS 
PRSSVDDALDCSDAAANHNQVVQHPQLSVVSVSPVRSPQPSTSSHLKRQ1VEDMPVLK 
RVLQAPPLYDTNSLMDEAYKPHKKFRALRHREFETAEADASSSTSGSNSLSAGSPRQS 
30 PVPNSVATPPPSAASAAAGNPAQSQLHMHLTRSSPKASMASSHSVLAKSLMAEPRMTP 
EQMKRSDIIQNYLKRENSTAASSTTNGVGNRSPSSSSTPPPSAVQNQQRWGSSSV1TT- 
TCQQRQQSVSPHSNGSSSSSSSSSSSSSSSSSTSSNCSSSSASSCQYFQSPHSTSNGT 
SAPASSSSGSNSATPLLELQVDIADSAQPLNLSKXSPTPPPSICLHALVAAANAVQRYP 

35 TLSADVTVTASNGGPPSAAASPAPSSSPPASVGSPNPGLSAAVHKV^4LEA 

30. SEQ ID NO: 30 Accession No. NM_1 68757 Drosophila melanogaster 
Ecdysone-induced protein 75B CG8127-PD 

40 1 agtcaccgtc geagtegcag cagttgaggl tcgctctcct egattteggg caaatccgat 

61 accatatagc acagcgtacc gcactctggg tatattegta aegegctttg gcttttacag 

1 2 1 ttagtcgcgt tcgagacctt gtcgagtttt gtcatgttag ccagcgatcc gcgggatccg 

181 aaataageca agaatcacaa cgcgagtgcg geagttgeca gcagtaacta caccaatatt 

241 tatattaatt aaaataaatt aaatgaaaca acatgetgat taatgccaat gaatgttaaa 
45 301 tgcaattgtt aatgtgaaga aaagtcgacc aagtctcccc aaaacaacac ttattcaaca 

361 tccactacac actcgccttt ctggattacg cgcccaaaaa aaaacaaaaa ttaaaaatta 

421 aaccaaacca acaactaatt tatttgetaa atattccaaa aancaatca atgtgaaaag 

481 caagcaaaca aagncctct cacaacaaaa cagcagttaa ttaaaatatc taaccgagat 

54 1 aaagtgcaaa gaagataaca agtttctcaa gcaaacatcc atatglacct gagtaccaac 
50 601 caaaaagctg tgtgtgtgcc aaaaaccgaa gaggaattat ccaaaaatat ttaatgagca 

661 agctcaactg agtggttgat gtgcccccca agggaaaagt gaccaagtca agatattttg 

72 1 tcaaatcgaa cacagaaaac acaaaaatgg gcgaagaact cccgatattg aagggcatac 

78 1 ttaaaggcaa cgtcaactat cacaatgcgc ctgtgcgttt tggacgcgtg ccgaagcgcg 

841 aaaaggcgcg tatcctggcg gccatgcaac agagcaccca gaatcgegge cagcagegag 
55 901 ccctcgccac cgagctggat gaccagccac gcctcctcgc cgccgtgctg cgcgcccacc 
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96 1 tcgagacctg tgagtlcacc aaggagaagg teteggegat geggcagegg gcgcgggait 

1 02 1 gcccctccta ctccatgccc acacttctgg cctgtccgct gaaccccgcc cctgaactgc 

1 08 1 aaicggagca ggagnctcg cagegmeg cccacgtaat tegeggegtg atcgactttg 

1141 ceggcatgat tcccggcttc cagctgctca cccaggacga taagttcacg ctcctgaagg 
5 1 201 egggactett cgacgccctg tttgtgcgcc tgatctgeat gtttgactcg tcgataaact 

1261 caatcatctg tctaaatggc caggtgatgc gaegggatge gatccagaac ggagccaatg 

1 32 1 cccgcttcct ggtggactcc acencaatt tegeggageg catgaactcg atgaacctga 

1 38 1 cagatgeega gataggectg ttctgcgcca tegttctgat tacgeeggat cgccccggtt 

1 44 1 tgcgcaacct ggagctgatc gagaagatgt actcgcgact caagggctgc ctgeagtaca 
10 1 50 1 ttgtcgccca gaataggece gatcagcccg agttcctggc caagttgctg gagacgatgc 

1561 ccgatctgcg caccctgagc accctgcaca ccgagaaact ggtagttttc cgcaccgagc 

1621 acaaggagct gctgcgccag cagatgtggt ccatggagga cggcaacaac agegatggee 

168 1 agcagaacaa gtcgccctcg ggcagctggg eggatgecat ggacgtggag geggecaaga 

1 74 1 gtccgcttgg cteggtateg agcactgagt ccgccgacct ggactaegge agtccgagca 
15 1 80 1 gttcgcagcc acagggegtg tctctgccct cgccgcctca gcaacagccc teggctctgg 

1 86 1 ccagctcggc tcctctgctg gcggccaccc tetceggagg atgtcccctg cgcaaccggg 

1 92 ! ccaattccgg ctccagcggt gaetceggag cagctgagat ggatatcgtt ggctcgcacg 

1 98 1 cacatctcac ecagaaeggg ctgacaatca cgccgattgt gcgacaccag cagcagcaac 

204 1 aacagcagca geagategga atactcaata atgegcatte ccgcaacttg aatgggggac 
20 2101 acgcgatgtg ccagcaacag cagcagcacc cacaactgca ccaccacttg acagceggag 

2161 ctgcccgcta cagaaagcta gattcgccca eggatteggg cattgagtcg ggcaacgaga 

222 1 agaacgagtg caaggeggtg agttcggggg gaagttcctc gtgctccagt ccgcgttcca 

2281 gtgtggatga tgcgctggac tgcagcgatg ccgccgccaa tcacaatcag gtggtgcagc 

234 1 atccgcagct gagtgtggtg tccgtgtcac cagttcgctc gccccagccc tccaccagca 
25 240 1 gecatctgaa gcgacagatt gtggaggata tgcccgtgct gaagcgcgtg ctgcaggctc 

2461 cccctctgta cgataccaac tegctgatgg aegaggecta caagccgcac aagaaattcc 

252 1 gggccctgcg geatcgegag ttcgagaccg ccgaggcgga tgccagcagt tccacttccg 

258 1 gctcgaacag cctgagtgcc ggcagtccgc gacagagtcc agtcccgaac agtgtggcca 

264 1 cgcccccgcc atcggcggcc agcgccgccg caggtaatcc cgcccagagc cagctgcaca 
30 270 1 tgcacctgac ccgcagcagc cccaaggcct cgatggccag ctcgcactcg gtgctggcca 

2761 agtctctcat ggccgagccg cgcatgacgc ccgagcagat gaagegcage gatattatcc 

282 1 aaaactactt gaagegegag aacagcacag cagccagcag caccaccaat ggcgtgggca 

288 1 accgcagtcc cagcagcagc tccacaccgc cgccatcggc ggtccagaat cagcagegtt 

294 1 ggggcagcag cteggtgate accaccacct gccagcagcg ccagcagtcc gtgtcgccgc 
35 300 1 acagcaaegg ttccagctcc agttcgagct ctagctccag ctccagrtcg tcatcctcct 

306 1 ccacatcctc caactgcagc tccagctcgg ccagcagctg ccagtatttc cagtcgccgc 

3121 actccaccag caacggcacc agtgcaccgg cgagctccag ttegggateg aacagcgcca 

3181 cgcccctgct ggaactgcag gtggacattg ctgactcggc gcagcctctc aatttgtcca 

3241 agaaatcgee cacgccgccg cccagcaagc tgcacgctct ggtggccgcc gccaatgccg 
40 330 1 ttcaaaggta tcccacattg tccgccgacg tcacagtgac agcctccaat ggcggtcctc 

3361 cgtcggcggc ggcgagtccg gcgcccagca gcagtccgcc ggcgagtgtg ggctccccca 

3421 atccgggcct gagcgccgcc gtgeacaagg taatgctgga ggegtaagag egggaggagg 

348 1 taggtggttt tacgeggaga agtgggagag acagagactg ggagtggcag ttcagcgaag 

3541 caggaagcag gatcacttgg ageggeggga gttgaattaa attattttac catttaattg 
45 3601 agacgtgtac aaagtttgaa agcaaaacca acatgeatge aatttaaaac taatatttaa 

3661 agcaacaaca aacaaaacaa ctacaagtta ttaatttaaa aaacaaacaa acaaacaaac 

3721 aacaaaaaac ccaagcttga atggtattac 

31. SEQ ID NO: 31 Accession No. NM J 68892 Drosophila melanogaster 
50 Ecdysone-induced protein 78C CG18023-PBEip78C) 

MHPSHLQQQQQQHLLQQQQQQQHQPQLQQHHQLQQQPHVSGVRV 
KTPSTPQTPQMCSlASSPSELGGCNSANNNNNN^mNSSSGNASGGSGVSVGVVVVGGH 
QQLVGGSMVGMAGMGTDAHQVGMCHDGLAGTANELTVYOV1MCVSQAHRLNCSYTEEL 
55 TRELMRRPVTVPQNG1ASTVAESLEFQKJWLWQQFSARVTPGVQR1VEFAKRVPGFCD 
FTQDDQL1LIICLGFFEVWLTHVARLINEATLTLDDGAYLTRQQLE1LYDSDFVNALLN 
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FANTLNAYGLSDTEIGLFSAMVLLASDRAGLSEPKVIGRARELVAEALRVQILRSRAG 
SPQALQLMPALEAKIPELRSLGAKHFSHLDWLRMNWTKLRLPPLFAEIFDIPKADDEL 

32. SEQ ID NO: 32 Accession No. NMJ 68892 Drosophila melanogaster 
5 Ecdysone-induced protein 78C CG18023-PBEip78C) 

1 aagcattaac gaaagaactg cgcacaaagt agggaggcaa taattacata tgtacatggc 
61 tgggaaaggc cttaactaaa cttagcaaac taataaatag aaaaaaggaa atattggeca 

121 aatattatag tattgggaat attaggttac ttgatatcaa aaattaatgt ctattttata 
10 181 cacttattct tagacttaat gttaacttat cgtacttatt atgattggtt tttcaagatt 

241 accagaactt gatagattgg tctagctttt gaaateggat agcattttct ttaaaggact 

301 ttgccatatg etaaagecta acttcttttt tcaattcagc cacagctgac aaaagcgaag 

361 aaaatttgaa agaccgtgaa tccttttgaa acgccctctc cggattcctc attaagtgea 

42 1 aaagatataa categcagag atttcccata aaaatgctga tcaggcgccc tcgcaggttg 
IS 481 ccaacgtcga tttccgccag caggacgatg atgaagatga tggatgecca tctcaccgat 

54 1 tcgatccgag caacatggat gtataccaaa tagagctgga ggaacaggca caaatccgct 

601 ccaaactgct ggtcgaaacc tgtgtgaagc actcgtcttc ggagcagcag cagctccaag 

66 1 ttaagcagga ggacctcatc aaggatttca ctegggaega ggaggaacag ecaagegaag 

721 aggaggegga ggaagaggac aacgaagagg acgaggaaga agaaggegaa gaagaagagg 
20 781 aggacgagga egaggaagee ctgctgccgg tagtcaattt taatgcaaat tcagacttta 

84 1 atttgeattt ctttgacaca ceggaggact cgtccaccca aggggectae agtgaggcca 

901 atagcrtgga atccgagcag gaagaggaga agcaaacaca gcagcatcag cagcagaagc 

961 agcatcaccg ggatttggag gattgectaa gtgccattga agctgatcca ttgcagttgt 

1 02 1 tgeattgega cgacttctat agaacatcag ccctagcaga gagtgttgca gecagtctaa 
25 1081 gcccacagca gcagcagcaa cggcagcaca cccaccagca acaacagcaa cagcagcagc 

1141 agcagcaaca ccctggacag cagcaacatc agctcaactg cacgctgagc aatggtggag 

1 201 gtgctttgta caccatcagc agtgtgcatc agttcggtcc ggccagcaac cacaacacca 

1261 gcagcagctc cccctcctcc agcgccgccc actcttcgcc ggacagegge tgetegtegg 

1 321 cctcctcctc eggatctteg cgatcctgcg gatcctcctc tgcatcctcc tcctcgtcag 
30 1381 eggtcagcag caccatcagc agcggccgca gcagcaacaa cagcgtcgtc aaccccgcag 

1441 caacatcttc atctgttgcg catctgaaca aagagcaaca gcagcagcca ctgccgacga 

1 501 cacagctgca acagcagcag cagcaccagc ageagttgea acacccgcag cagcagcaat 

1 561 ettttggect agcagacagc agcagcagca aeggcagcag caacaacaac aacggtgtct 

1 62 1 cctcgaaatc atttgtgccc tgeaaagtet gtggcgacaa ggcatcggga taccactatg 
35 1 68 1 gtgtaacctc ctgcgagggt tgcaagggat tetttegteg cagtatccag aagcaaatcg 

1 741 aatatcgctg tttgegggae ggcaagtgcc tggtcatcag actgaaccgc aatcgctgcc 

1 801 agtactgccg cttcaagaaa tgcctttccg ctggcatgag ccgcgattcc gtacgttatg 

1 861 gtcgcgttcc caagcgttcc cgtgagctga aeggagegge cgcctcctcc gccgccgctg 

1 92 1 gagctcctgc ctccctcaat gtggatgact ctaccagcag cacactgcac ccgagtcacc 
40 1 98 1 tacagcagca gcagcaacag catctactac agcagcaaca gcagcagcaa catcagccac 

204 1 agetgeagea acaccaccaa ctgcaacagc ageegcatgt aagcggcgta cgtgtgaaga 

2101 ccccgagtac tccacaaacg ccacaaatgt gttcgatege ctcctcgcca teggagctgg 

2161 gcggttgcaa tagtgccaat aacaataaca alaataacaa caacagtagc ageggtaatg 

2221 ccagcggtgg cageggegtg agegteggeg ttgttgttgt gggeggacac cagcaactgg 
45 228 1 tgggaggcag catggtggga atggcgggca tgggcacgga tgcccaccag gtgggcatgt 

234 1 gtcacgaegg cttggcggga aeggcaaacg agetgacegt ctacgatgtc atcatgtgcg 

2401 tgtegcagge gcaccgcctc aactgctcct acaeggagga actgaccaga gagctcatgc 

2461 gtcgtcccgt gaeggtgeca caaaatggga ttgccagcac agtggccgag agtctggagt 

252 1 tccagaagat ctggctgtgg caacagttct eggecagggt gacgcctggc gttcagcgga 
50 2581 ttgtggagtt tgegaaaege gtacctggct tctgtgattt cacccaagat gaccagctta 

2641 tactaataaa gctgggcttc ttcgaggtct ggttgaccca tgtggcccgg ttgatcaatg 

2701 aggegacatt gacactggac gatggtgcct acctgacgcg ccagcagctt gagatactct 

2761 acgattctga ctttgtcaac gecttgetga actttgecaa cacgctgaac gcciacgggc 

2821 tgagtgacac egaaategga ctcttctcgg ccatggtgct gcngcctcg gatcgagctg 
55 288 1 gaetcagega geccaaggtg ateggcaggg ccagggaact ggtggccgag gcgctgcgcg 

2941 tacagatcct gcgttcgcgg gcaggatccc cacaggcgct gcagctgatg ccggcgctgg 

— 110 — 



Aiiorncy Dockci Number 21 1 0 1. 0053 U I 



3001 aagccaagat acccgagctg agalccttgg gggccaagca cnctcacac ctagactggc 
3061 tacggatgaa ctggaccaag ctgcgcctgc cgcccctctt cgccgagatc ttcgacatcc 
3121 cgaaggctga cgatgagctg taggatgtgg agccaacccc gcgattccag ggccgtgcaa 
3181 agcaaaccgc aacaagaaca gaatattcta ccacttgtag gcttaagcaa cgtagctata 
5 324 1 gatcgaaatg ggagggccgc agatcagata cacgtctact cagcattacc ggagagatag 
330! tccactaagc ctatatgcat actactatac tagcagtgtt a 

33. SEQ ID NO: 33 Accession No. NM_1 65465 Drosophila melanogaster 
Ecdysone receptor CG1765-PB (EcR) 

10 

MKRRWSNNGGFMRLPEESSSEVTSSSNGLVLPSGVNMSPSSLDS 
HDYCDQDLWLCGNESGSFGGSNGHGLSQQQQSVITLAMHGCSSTLPAQTTHPrNGNA 
NGNGGSTNGQYVPGATNLGALANGMLNGGFNGMQQQ1QNGHGL[NSTTPSTPTTPLHL 
QQNLGGAGGGG1GGMGILHHANGTPNGLIGVVGGGGGVGLGVGGGGVGGLGMQHTPRS 
15 DSVNSISSGRDDLSPSSSLNGYSANESCDAKJCSKKGPAPRVQEELCLVCGDRASGYHY 

NALTCEGCKGFFRRSVTKSAVYCCKJGRACEMDMYMRRKCQECRLKKCLAVGMRPECV 
VPENQCAMKJlREKKAQlCEKDKjVtTTS^ 

QHATIPLLPDEILAKCQARN1PSLTYNQLAV1YKLIWYQDGYEQPSEEDLRRIMSQPD 
ENESQTDVSFRH1TEIT1LTVQL1VEFAKGLPAFTKIPQEDQITLLKACSSEVMMLRM 

20 ARRYDHSSDSIFFANNRSYTRDSYKMAGMADNIEDLLHFCRQMFSMKVDNVEYALLTA . 
IVIFSDRPGLEKAQLVEAIQSYY1DTLRJY1LNRHCGDSMSLVFYAKLLSILTELRTL 
GNQNAEMCFSLKLKNRXLPKJLEEIWDVHA1PPSVQSHLQITQEENERLERAERN1RAS 
VGGAITAGIDCDSASTSAAAAAAQHQPQPQPQPQPSSLTQNDSQHQTQPQLQPQLPPQ 
LQGQLQPQLQPQLQTQLQPQIQPQPQLLPVSAPVPASVTAPGSLSAVSTSSEYMGGSA 

25 AIGPrTPATTSSrTAAVTASSTTSAVPMGNGVGVGVGVGGNVSMYANAQTAMALMGVA 

LHSHQEQLIGGVAVKSEHSTTA 

34. SEQ ID NO: 34 Accession No. NM_1 65465 Drosophila melanogaster 
30 Ecdysone receptor CGI 765-PB (EcR) 

1 tagtattttt ttggactttg ttgttaacgg ttgttcgctc gcacgtacga agcccgatcg 

61 cgttcgtcaa aaaacaagat acaaaataca gcacacacaa ttgaaaacga caacctaaca 

1 2 1 gtacggtttc ccaaagcacc ttacatttca aaaccgaaaa cccccaaaat gttgtaacca 
35 181 aataatgttt aaatcacata tacacctaca tatatttatg aaaaattgtt agacaaatcc 

241 caaataatac cagttccccc aacaaccgca acaaacacaa gtgcaattca tcggcaaaaa 

301 ttaatataaa gtgcaaatgc attgtagctg aaactcaaac aatagtaaaa atacatacat 

361 aagtggtgaa gaagcaaaag gaaatagttc ttaaaataac gcaaatcgag agcatatatt 

421 catatttgta cagatattat atggcggctg catagtgcaa actgcggcig agggaataca 
40 481 gcggtatcga aatgtaaata ggaaacaacg aagccagaac tcgaaatcaa acatcagcaa 

541 cgtgacacac agacataaga cgcccgtcta gtcgtggtct gtggaacgct agctccgctt 

601 tgccaggagc cggagacttt ttccgcatcc acaatattac atatgtacat atatcgaaga 

661 tagtgcgcga gtgagtgagg gatttgtgcc gtggatcccg atccccttac atatatataa 

72 1 aggtagtgaa aagattttac tcaacattcc aaatagtgct ttgtcaactg gaataccttt 
45 781 tgttcaaata cgcagtgggc ccatggatac ttgtggatta glagcagaac tggcgcacta 

841 tatcgacgca tatgctctga ttgtttcccg cactaaatga gcagggattc gggcgaaaat 

901 gtattttgaa cgcaaacaag tgcgcaaaaa atactagctc caccacgaaa ctgcacaaaa 

961 caccgccaga agcgagcaga acctcgggcc gcacgaccga gcttcgtaaa gcaacagagg 

1021 atcttaccag gagatagctc ttctccacat agaccaactg ccagggacaa gclccttgtc 
50 1 08 1 cccagccgac gctaagtgaa cggaaaacgg ccacaaaacg gcgactatcg gctgccagag 

1141 gatgaagcgg cgctggtcga acaacggcgg cttcatgcgc ctaccggagg agtcgtcctc 

1201 ggaggtcacg tcctcctcga acgggctcgt cctgccctcg ggggtgaaca tgtcgccctc 

1261 gtcgctggac tcgcacgact artgcgatca ggacctttgg ctctgcggca acgagtccgg 

1321 ttcgtttggc ggctccaacg gccatggcct aagtcagcag cagcagagcg tcatcacgct 
55 1381 ggccatgcac gggtgctcca gcacictgcc cgcgcagaca accatcattc cgatcaacgg 
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1 44 1 caacgcgaat gggaatggag gctccaccaa tggccaatat gtgccgggtg ccactaalct 
1 501 gggagcgttg gccaacggga tgcicualgg gggcttcaat ggaatgcagc aacagattca 
1561 gaatggccac ggcctcatca actccacaac gccctcaacg ccgaccaccc cgctccacct 
1621 tcagcagaac ctggggggcg cgggcggcgg cggtatcggg ggaatgggta ttcttcacca 
5 1 68 1 cgcgaatggc accccaaatg gccttatcgg agttgtggga ggcggcggcg gagtaggtct 
1 74 1 tggagtaggc ggaggcggag tgggaggcct gggaatgcag cacacacccc gaagcgattc 
1801 ggtgaattct atatcttcag gtcgcgatga tctctcgcct tcgagcagct tgaacggata 
1 86 1 ctcggcgaac gaaagctgcg atgcgaagaa gagcaagaag ggacctgcgc cacgggtgca 
1921 agaggagctg tgcctggttt gcggcgacag ggcctccggc taccactaca acgccctcac 

10 1981 ctglgagggc tgcaaggggt tctttcgacg cagcgttacg aagagcgccg tctactgctg 

204 1 caagttcggg cgcgcctgcg aaatggacat gtacatgagg cgaaagtgtc aggagtgccg 
2101 cctgaaaaag tgcctggccg tgggtatgcg gccggaatgc gtcgtcccgg agaaccaatg 
2161 tgcgatgaag cggcgcgaaa agaaggccca gaaggagaag gacaaaatga ccacttcgcc 
222 1 gagctctcag catggcggca atggcagctt ggcctctggt ggcggccaag actttgttaa 

1 5 228 1 gaaggagatt cttgacctta tgacatgcga gccgccccag catgccacta ttccgctact 
234 1 acctgatgaa atattggcca agtgtcaagc gcgcaatata ccttccttaa cgtacaatca 
2401 gttggccgtl atatacaagt taatttggta ccaggatggc tatgagcagc catctgaaga 
2461 ggatctcagg cgtataatga gtcaacccga tgagaacgag agccaaacgg acgtcagctt 
252 1 tcggcatata accgagataa ccatactcac ggtccagttg attgttgagt ttgctaaagg 

20 258 1 tctaccagcg tttacaaaga taccccagga ggaccagatc acgttactaa aggcctgctc 
2641 gtcggaggtg atgatgctgc gtatggcacg acgctatgac cacagctcgg actcaatatt 
2701 cttcgcgaat aatagatcat atacgcggga ttcttacaaa atggccggaa tggctgataa 
276 1 cattgaagac ctgctgcatt tctgccgcca aatgttctcg atgaaggtgg acaacgtcga 
2821 atacgcgctt ctcactgcca ttgtgatctt ctcggaccgg ccgggcctgg agaaggccca 

25 288 1 actagtcgaa gcgatccaga gctactacat cgacacgcta cgcatttata tactcaaccg 
2941 ccactgcggc gactcaatga gcctcgtctt ctacgcaaag ctgctctcga tcctcaccga 
3001 gctgcgtacg ctgggcaacc agaacgccga gatgtgtttc tcactaaagc tcaaaaaccg 
3061 caaactgccc aagttcctcg aggagatctg ggacgttcat gccatcccgc catcggtcca 
3121 gtcgcacctt cagattaccc aggaggagaa cgagcgtctc gagcgggctg agcgtatgcg 

30 3181 ggcatcggtt gggggcgcca ttaccgccgg cattgattgc gactctgcct ccacttcggc 

3241 ggcggcagcc gcggcccagc atcagcctca gcctcagccc cagccccaac cctcctccct 
3301 gacccagaac gattcccagc accagacaca gccgcagcta caacctcagc taccacctca 
3361 gctgcaaggt caactgcaac cccagctcca accacagctt cagacgcaac tccagccaca 
3421 gattcaacca cagccacagc tccttcccgt ctccgctccc gtgcccgcct ccgtaaccgc 

35 348 1 acctggttcc ttgtccgcgg tcagtacgag cagcgaatac atgggcggaa gtgcggccat 
3541 aggacccatc acgccggcaa ccaccagcag tatcacggct gccgttaccg ctagctccac 
3601 cacatcagcg gtaccgatgg gcaacggagt tggagtcggt gttggggtgg gcggcaacgt 
3661 cagcatgtat gcgaacgccc agacggcgat ggccttgatg ggtgtagccc tgcattcgca 
372 1 ccaagagcag cttatcgggg gagtggcggt taagtcggag cactcgacga ctgcatagca 

40 378 1 ggcgcagagt cagctccacc aacatcacca ccacaacatc gacgtcctgc tggagtagaa 
3841 agcgcagctg aacccacaca gacatagggg aaatggggaa gttctctcca gagagttcga 
3901 gccgaactaa atagtaaaaa gtgaataatt aatggacaag cgtaaaatgc agttatttag 
3961 tcttaagcct gcaaatatta cctattattc atacaaatta acatataata cagcctatta 
4021 acaattacgc taaagcttaa ttgaaaaagc ttcaacaaca attggacaaa cgcgttgagg 

45 408 1 aaccgggaga aaatttaaga aaaaaaaaac cattgaaaat tatgaaattt agtatacatt 
4141 ttttttgggt ggatgtatgt cgcatcagac tcacgatcaa ttctcgaatt ttgttaacta 
4201 aattgatcct ccaaactgca tgcgaaacag atcagaaaag agaacagaca gtagggcgtg 
4261 aacagaggga agagagaaga gaataaagat tgtttatatt taaaaaatat ataaaataat 
4321 aattactaac tctaaacgta atgaaagcaa ctgtataata tctaactata actataaatt 

50 438 1 cgtactgtag ggaagtgaga aaatctgtta aatgaaacaa aaataatgat aataacatta 
4441 tcatccacca taattaaaat catttaaagt aattaaaaac aaaacacttt taaaacacgc 
4501 aaaacttgga ctgattttat aaatattttt taatcataaa gaaaggcaac ctgaaaaaaa 
456! tattacaaaa acaaataaca acatatttta ttatgacacc cttatatgtt ttcaaaacga 
4621 gaatttaaat tcnagattc ttataatttc atccaaaaat attagccagc aaaaaccttt 

55 468 1 attattggca ttgtttttag acatgttttc aaaaaaaact ttgatattga aactaaacaa 

474 1 aggalaatga aatgaaagtg atlggagtct tactcaaaaa ccaaaaggca tcaaaaggta 
4801 ttaaattaaa aatataatct aatrtcgagt tcaagaaaca ctttttggtg gaaaatagtt 
4861 ttcaatcact ttgataaaaa ccacacaaat taataaatac atgcatacac caaaagactt 
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4921 caatatatat ttttaaaatt tacattgata attcgaaatt tgaataagaa tcacatccat 
498 1 ctaatttggc taaatcaaaa tttttatgaa agccacacaa aaaacgtgca aatttgatta 
504 1 ctttggcaat ttttatgtta*tacaaaattt atgcaattga ttttcaaaat aatttttatt 
5101 agattgtatt agincattt tgctttggga tgtacatttt aaataaattt tactttaaat 
5 5161 tgttggcctt attttaactt aaatcaaatt tattctaatt ttagtaaaaa aaaatgtgtt 

522 1 taaaattgaa aataagaaca ctgtaaaata ttaataaaaa attaaagttt aaagtgattc 
528 1 ttttattatg taaaaagaag acaaaaaata tettaegtag ctttctactt gaattgtgca 
534 1 attttttact tttactacta atcctaattt aaatataatt tacacacacg cctacacatc 
5401 cagccacata tttttaattt laagtcaacc taatttataa atatgaattt gtataatgac 
1 0 546 1 gaactaaaat tagcatgaca tcatggacat acttggaaat aactctatca aacgagctaa 
552 1 atgcattgaa gaagaaaatt cttgttaaat atagtctgea cttcgacaaa cgaaaatcag 
5581 tgaatt 

35. SEQ ID NO: 35 Accession No. NM_1 65364 Drosophila melanogaster 
15 Hormone receptor-like in 39 CG8676-PD Hr39) 

MPNMSSIKAEQQSGPLGGSSGYQVPVNMCTTTVANTTTTLGSSA 

GGATGSRHNVSVO^IKCELDELPSPNGNMVPVIANYVHGSLRIPLSGHSNHRESDSEE 

ELASIENLKVRRRTAADKNGPRPMSWEGELSDTEVNGGEELMEMEPT1KSEVVPAVAP 

20 PQPVCALQP1KTELENIAGEMQIQEKCYPQSNTQHHAATKLKVAPTQSDPINL1CFEPP 
LGDNSPLLAARSKSSSGGHLPLPTNPSPDSAIHSVYTHSSPSQSPLTSRHAPYTPSLS 
RNNSDASHSSCYSYSSEFSPTHSPIQARHAPPAGTLYGNHHGIYRQMKVEASSTVPSS 
GQEAQNLSMDSASSNLDTVGLGSSHPASPAGISRQQLINSPCPICGDKJSGFHYG1FS 
CESCKGFFKJ^TVQNRKNYVCVRGGPCQVSISTRKJCCPACRFEICCLQKGMKLEAIREDR 

25 TRGGRSTYQCSYTLPNSMLSPLLSPDQAAAAAAAAAVASQQQPHQRLHQLNGFGGVPI 
PCSTSLPASPSLAGTSVKSEEMAETGKQSLRTGSVPPLLQEIMDVEHLWQYTDAELAR 
INQPLSAFASGSSSSSSSSGTSSGAHAQLTNPLLASAGLSSNGENANPDLIAHLGNVA 
DHRLYTCIVKWCKSLPLFKNISIDDQICLLINSWCELLLFSCCFRSrDTPGEIKMSQGR 
KITLSQAKSNGLQTCIERMLNLTDHLRRLRVDRYEYVAMKVIVLLQSDTTELQEAVKV 

30 RECQEKALQSLQAYTLAHYPDTPSKFGELLLRJPDLQRTCQLG1CEMLT1KTRDGADFN 

LLMELLRGEH 

36. SEQ ID NO: 36 Accession No. NM_1 65364 Drosophila melanogaster 
35 Hormone receptor-like in 39 CG8676-PDHr39) 

I actaacaaaa caaacatttt getacttegt cgcaggcggg actgtgttgc gtcgtgtgat 

6 1 cgctagagcg gttgtggaat eggattcgag cgcaaaacac cgttcatgct gtgagcgaaa 

1 2 1 aagagtggta gcgcctacag tggcatatgt agttaaatcc gtgaataagt gaaaaatccg 
40 181 atatttgtcg igcaataatt tcctcgattg gcatcaagtg gcttccagtc gggtacatat 

241 tgcacaagaa atgttatacg cataatgtgc aegcaaatta aacgaattct ctatgaaaat 

301 gtgactagaa tgtgagtcga acaaaacgag taaaacgtga aatcccaact ggcttttggg 

361 taacaaatct tatcaacaca geaaeggaaa tacattaaaa tcttgataga ctgagaaagg 
• 421 gacaattgga atacttttag ttatttttaa atgttttaca acacaatgga actgeatcaa 
45 48 1 cgacacctct caaactttta caaattgeae aactgagaaa tagtctttga taaataaata 

54 1 aaatataaga aategctact gaaacaagat gecaaacatg tccagcatca aageggagea 

60 1 geaaageggt cctcttggag gaagtagegg ctatcaagta ccggtcaaca tgtgcaccac 

66 1 cacagtcgcg aatacgacga ccactttggg aagctccgcc gggggageca ctggctcccg 

721 gcacaacgtc tccgtgacaa acatcaagtg cgaactagac gaactaccgt caccgaacgg 
50 78 1 caacatggtg ccggttatcg caaactacgt teaeggtage ttgegcatte cactcagtgg 

84 1 acattcaaat caiagggagt ccgattcgga ggaggagctg gcaagtattg agaacttgaa 

901 ggttcggcga aggaeggegg eggacaaaaa tggtcctcgt ccaatgtcct gggagggega 

961 getgagegat actgaggtca aegggggega agagctgatg gaaatggagc caacaattaa 

1021 gagtgaggtg gtccctgctg ttgcaccccc acaacccgtc tgcgcactac aaccgataaa 
55 1 08 1 aacagagcta gagaacattg caggegagat gcagattcaa gagaagtgtt acccccagtc 
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1141 caacacacaa catcacgctg ccacaaaatt aaaagtggcc ccgacgcaaa gtgatccgat 
1 20 1 caatctcaag ttcgaaccgc ctctgggaga caattctccg ctactggctg caegtagcaa 
1 261 gtccagcagt ggaggccacc taccactgcc aacgaatccc agtcccgact ccgccataca 
1321 ttccgtctac acgcacagct ccccctcgca gicgcctctg acgtcgcgcc acgcccccta 
5 1381 cactccgtct ctgagccgca acaacagega cgcctcgcac agtagctget acagctatag 

1 44 1 ctccgaattc agtcccacac actcgcccat teaagegegt catgccccac ccgccggcac 
1 501 gctctatggc aaccaccatg gtatttaccg ccagatgaag gtggaagcct catccactgt 
1561 gccgtccagt gggcaggagg cgcagaacct gagtatggac tctgcctcta gcaatctgga 
1 62 1 tacagtgggc ttaggatctt cgcaccccgc atctccggcg ggcatatcac gtcagcagtt 

10 1 68 1 gatcaactcg ccctgcccca tctgcggtga caagatcagc ggatttcatt aegggatttt 
1 74 1 ctcctgcgag tettgeaagg gcttcttcaa gcgcaccgtg caaaatcgea agaactacgt 
1 801 gtgcgtgcgt ggtggaccat gtcaggtcag catttccacg cgcaagaaat gtccagcctg 
1 861 ccgcttcgag aagtgtctgc agaagggaat gaaactagaa gcgattcggg aggaccgaac 
1 92 1 ccgtggcggc cgctccacat accagtgctc ctacacgctg cccaactcaa tgettagtec 

15 1 98 1 getgettagt cctgatcaag cggcagcagc tgccgccgca gcagcagtgg caagtcagca 

2041 gcagccgcac cagcgactac atcaactaaa tggatttgga ggtgtaccca ttccctgctc 
2101 tacttctctt ccagccagcc ctagtttggc aggaacttcg gtcaagtegg aagagatggc 
2161 ggagaeggge aagcaaagee tecgaaeggg aagegtacca ccactactgc aggaaatcat 
222 1 ggatgtagag catctglggc agtacaccga tgcagagctg gcccgcatca accaaccact 

20 228 1 gtccgcattc gcctctggca gctcncgtc gtegtcateg tcaggtacat cctcaggcgc 

234 1 ccatgcacaa ctcaccaatc cactactggc tagtgctggt clctcgtcca atggcgagaa 
2401 tgccaatcct gatcttatcg ctcatctctg caacgtggct gatcaccgtc tttataaaat 
2461 cgtcaaatgg tgeaagaget tgccgctttt taagaacatt tcgatcgatg accaaatctg 
2521 ettgetcatt aactcgtggt gcgagctgtt gctcttctcc tgctgtttta gatcaattga 

25 258 1 tactcctgga gagattaaaa tgtcacaagg caggaagata accctatcgc aggecaaate 
2641 aaatggcttg cagacttgea ttgaaeggat gctcaaccta acagatcacc tgaggegatt 
2701 gcgcgttgat cgctacgaat atgttgccat gaaagttatt gtgctgttgc agtcagatac 
2761 gacagagtta caggaagegg taaaggtgcg cgagtgtcag gaaaaagctt tgeagagett 
2821 gcaagcttac accctggcgc attatcctga cacgccatcc aagtttgggg agettttget 

30 2881 acgcattcct gatttgeage gaacgtgcca gcttggcaag gagatgttga cgatcaagac 
2941 tcgcgatgga gctgatttca atttgetaat ggagcttttg cgeggagage attgacaatt 
3001 gataactaag aeggaaatet tttaccattg gcaaaacaag tttcacatat ttagtattag 
3061 atatatatat tctatagata agatccttac tgtaagttct gaaaacatgt gectaaaaac 
3121 caaagccacg atagcagtca catcaggccc actggtcgag attaaatcca agagcaagat 

35 3181 tgccaaattt ttacaccaat atatattttg atatgageca tgtgcagggc ctcagatcgc 
3241 tgttgttgtc ggctaaagtt tcagtaagaa aagtatatat tgattttget atttatacat 
3301 atttgactta tgtatagtgt aaactaaagc acacatggaa aatgaaaaga ctaaacaaat 
3361 ttatttaaag attactttta ctattataga aaaaggggaa aaataaaaaa cacaaaggca 
3421 gagaagaaaa tttagttaca acaggtagcg acatttttat attttcttat ataaggaaat 

40 348 1 attcaatgta ttttaaatat aaagecaaac ccgatttggt ttgggaaaga gctactgaaa 
3541 tttttgatat ctatatattc atcactagaa gacgaatgaa tgtatccaat gtttaaatgt 
3601 tgtagegm agttttagtg caatttcaca catgtctaca tacatgaata ttcagegaga 
3661 tatgtttgca aactattata aagcaaaaga ccactcgaaa tcgccatcac tgggttggct 
3721 aagactattc cagttatget gtttgttgca taaaaaacca caactaegta catcaataaa 

45 378 1 atgtataatt ttttattgga gttttagatt tgtattaact tcttccttat aattacgatt 

3841 attattatta ttactaattt tatgaatatt gtgtaacact gacttaaata gctgaaaaaa 
3901 tcctgcaaca ggatttaaaa cacctgaata cacaaaacat tataacatga atacattttg 
3961 ettatggect agatagtttg atatgtactt tgcatatgta tgcatgtgtc tatatgtgag 
402 1 tacgtaccat acaaattcct gtcccaccag aaaaatcaca cgcaataaaa aattccaaaa 

50 4081 tactaagctc gtatctacaa agaaagatta aaagacaaat tgatgaatag gaatatgttg 
4141 ccggaagtcc aagagatttg gctgaaagta tcgacaaatt ttcaacacat cgttcatgga 
4201 tattgtgcta acactctcag tttgaaaatc attttctgtt aaactttcta tataataagt 
4261 tctccattcg attttgtatt tacaatttgt ttctttaatt ttcctttatc agttgtatct 
4321 atgaaacatgaggatctcagttcatattga tcgtgttctt ctgccgtaca ccgcttctgt 

55 438 1 ccgttaatgt aaaccataag tataaatgaa attagttaaa tgtttattta taaataaagc 

4441 gctataataa atttcaatac atttatcata gttaactgat taagaccact gaaatcaaaa 
4501 atattttatt tactaagcaa agcacacgca aacaatttat aatgtnatt acgttaacaa 
4561 caaactcatt tttaataatt ctttatgaat acacaaagtt aegcaatttt ccctctaggc 
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462 1 geattgetta aatagttaaa gaaaaataat aaacccatag cgcaatattt aatgtaaaac 
46S I agtlttcctt gcgtgtgatg tttgetctag etaegtacaa attcatcatt lattaaattt 
474 1 aaaactcaat tttgetttta aataaattta ataagtaaaa ucaacaata attgatatac 
480 1 aattgtcaat gcaatatttt gtaataaaaa tgcgaaaaat c . 

5 

37. >SEQ ID NO:37 - 96_/« j:x4_7.55_kb+o!igosJVIap.seq 

GGGCCCCCCCTCGAGGTCGACGGTATCGATAAGCTTGCCGGTGGOSGAGAAGGTC^ 
CGATGAGAAGCAGCCACAGCCACATGACCACGGTGCGTCCGTACACAAGATGGCACTG 
TTGAGGAACTCCTCCAGTTCGTCCCCCACATGACCGC^ 
1 0 AACCAGGTGGACAGGTTCCGCTTCAATCGCGCCTCTCTCATC^ 

CATTGCCATGCACGACGTAGACTTGCTGCCCTTGAATGACAATCTGCTCT 

ACATCGCCGGACCGAAGCTACATCCCAAATACC^CrATGATAACTTCGTTGGAGGAATATTACTGG 
. TTTAAGCAGATGAACGGCATGTCGAACCAGTACTGGGGCTGGGGATT 
TGCAGGACTGCAGGTGACGCGGCCGCAGAACATTAAGACTGGCACTAATGATAC^ 

1 5 TTCGCAACTAGACGTGACTACTAATAATTATTGTCATTCA^ 

CACCCAGAAGTGCTTCAACCAGAAGGAGATGACCCGCAAGCGGGACCACAAGACGGG^ 
TACTTAAGGTGCATGAGATGCTCATTGACCAGGTGCCGGTGACCATCCTCAACATTTTGCT 
ACGCCTTGGTGCGACTGCTCCGGAACGGCAGCGGCTGCATCGGCGGTACAAACCTGATGGGTTC 
CTATGTTTATTTCGCTATTATAGTGTGTTGTATTGTATAAATGCGCTAATACACGTGCACCATGCCATAGAG 

20 AGAAGAGCACGTAGGTGCAAAGGCCGCCCATGAACTGATTGGTGAGCAGATTTCTGCGG 

TGGGTGCCCGATTTCACGAGCACCAGAATCCAGAGCACGAACACGGACAGGAAGTAGAAA^ 
CAGGCCCAAAATACCTGCGAATTGGTGGGACATTAACTAAGTTGGTT 

GCCCGAGATGGCAGCCATCGATGTGCGACAGTATTCCACGGCGGATATGTTGTTCCGGATGGCGCCCTCGCTGTAGGCGA 
TTATTTCGCCAGTCTTGGACTGCGTGGTCTTCACTCGATTCATTTTATTTAATTAAATTCT 
25 ATATTCCTAGGCTGTGAACTTCGATTGTGTGCCGATTGTGTTATCGATTGGTGCCGATAACTAT 
CTAGCGGTTTTTGCAGGATAAATAGTTTTTGTAAATTTTCCGAGATAAACTTGACGAGCTC 
GTTTAATACAATATCAAATATATTTGCTGAAGTGTATATTTATTCTCACCGCTCTGTC 
TTGCCATTCGCCCGGGCArcTAGATTGTTGTTATTGGGATTG^ 

TGTGGTGATAACGCGAGCTGCCGAATCTGCGTGCAATTCGTGCGTTTGACGTGGGTACTAACTGCT 
30 ACAGTTGTTCTGATACGCAGAGTTCCTGCCTCACCACACACGACCACCTCCAT 
CCTCCACCGACAGCAGCTGCTCCACCGCACCACCAGGAGAGGGGCAA 
TGTAACCGCCGAAGAACTGCGCGGTGTGCGGGGACAAGGCTCTGGGCTACAACT^ 
AAGGCGTTCTTCCGACGGAACGCGCTGGCCAAGAAGCAGTTCACCTC 

CACTCGACGCTTCTGCCAGAAATGCCGCCTGCGCAAGTGCCTGGATATCGGGATGAAG^ CATTATGT CCGAGG 

35 AGGACAAG CTGAT CAAGCGGCGCAAGATCGAGACCAACCGGG CCAAG CGA CGC CTCATGGAGAA CGGCA CGGATGCGTGC 
GACGCCGATGGCGGCGAGGAAAGGGATCACAAAGCGCCGGCGGATAGC^ 
AC^GGACTCGCAGAGCTGCGGCTCGGCGGACAGCGGGGCCAATGGGTGCTC 

AGGTCAATCCGCTTC^GATGACGGCCGAGAAGATAGTCGACCAGATCGTATCCGACCCGGATCGAGCCTCGCAGGCCATC 

AACCGGTTGATGCGCACGCAGAAAGAGGCTATATCGGTGATGGAGAAGGTAATC^ 
40 GGTGTCGCATTTGATCGACTATCCAGGTGGGTGCAGACAAGATTTCATCGTTT 

TTGAATCTTTACAGGCGACGCACTCAAGATCATTTCAAAGTTTATGAACTCGCCCTTTAACGCGCT 

TTAAAATTTGTGGTTTTAAACTTAATTTCA CATTCCTTGTTAATTTAAATACG CAGTATTCACCAAATTCATGAGCTCAC 

CCACXX^CGGCGTTGAAATTATCTCAAAGATAGTTGATTCGCCCGCGG 

TCGCCAGAGGACGCCATCGATATAATGAACAAGTTCATGAATACCCCAGCGGAGGC 
45 AAGCGGCGGAGGAGCGAACXSCAGCCCAGCAGACAGCAGACCGCAAGCC^ 

CAGCGCCAGCGGAGCGAGCTGATACTGTCATTCAAAGCATGCTGGGCAAC^GTCCGCCAATTT 

GTGGATCTGCAGTACCACTCGCCCGGTGTCGGGGAGCAGCCCAGTACATCGAGTAGCCACCCCTTGCCTTACATAGCCAA 
CTCGCCGGACTTCGATCTGAAGACCTTCATGCAGACCAACTACAACGA 
ACTCAATCGAATCGGTGCTATCCGAGGTGATCCGCATTGAGTACCAGGCCTTC^ 
50 GTAAAGGAGGAGATGTCCTACGGCACTCAGTCTACGTACGGTGGATGCAATTCGGCTGCAAACAATAGCC^ 
GCAGCAACCCATCTGCGCCCCATCCACCCAGCAGTTGGATCGCGAGCTA^ 
TGCGACTGGCCAGCGAGGCTCTTTATGATCCCGTGGACGAGGACCTCAGC^ 

GTAACCCGCTAGGGATAACAGGGTAATAACAGTCCACGGTATTAGCCTATAGGTCTTT CTACA TTTATAGCTCCAACACC 

ACGGCTTATCTAATCAGAGTGTGCGAGCTGCX1ATATATGTACACACGGCACCTGGCA 
55 GTGCGTCTCTCGATGTTGGCCCACGGGCCGTATCTTCGTCAGCCAGTTTCTGGGTTCCCAGCAATGCTCGCCTA 

GTAAACACIACTTTTTAATGGGGTGGCTCAAAGTTTTTGATTTCCCAAGAGCrTTGGTCGAG 

CCAGATAAGCTATTTTCCCCCAGAGGGTTAAAGAATTTGAAGTCATGCGACTGGGTCTAGTTAAG^ 

AATTGGCCTTTAATTAAGACCCTAAACGTGACAAACTTCCATTCTATA 

TATTTTCGGAACAAATCGGGCACTCATTTATATCTTTAGCTTC 
60 ACCTCTAGTTCCCGTGGAGAAATGATTTGGAATTATCCAATAATGATTCATCACTTCCA 

GAGCCACCCTAGCTTTCATGCAATCAGAACGTCTGGTCTGCCAAGAAGGAGCAGACAGCGGCTTT 

TGCCAATTGTGACACTATCAACCATTATCAAGAGTACCAGCAGGCGCTCATGAGTCTCCAGCCAGCCGTC 

TTTATTGTCGCTTAGACTGTTTACCGATTTTGCCTCATC^ 

ACAATTTTGACGAAATATATGGAG CAGCCAGGTGTTGGGCGCTATGATAAGCAGTGCTCCGCCATT CGATTGAGTCACCT 
65 TCCAGGGAGAAGCCTTTACGATTATGGCGATAATAATGGCCACCAAAGAGAACATGGGCAACAT^ 
AAGTTIX^CGAAGGCAATATCTACGAGGAGCACCAAAAGTTCATCACAACGTTTGACGA 
CATAATCCTGATGATGTGTGCCATTGTCCTTTTTACCTCG^ 

AACAGGTGAGTAAGGACTTGATACCATACTGCAGTATTACTAACTTTCTTTCATTCGATAGA^ 
TGCGAAGATATCTGGAGAGTGTTTATTCTGGCTGTGAGGCGAGAAACGCGTTTATCAAG 
70 GTGGAGCGTCTGAACAAGTTCATAATTAATGTCTATTTGAATGTTAACCCATCCC^ 
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ATTCGATTTGAAAAATCACTAGACAACCGATG 
ACAAGCTGTAGTTGTTGTTGTTGTTGATGTCTGTTTTAT^ 
TAGATTTGCATATACTGCATAGATTTTATATTTCTACATCAAAG^ 
ATCTATATGTAATGTACACCGTTTACCTAGTTTCAA^ 
5 CTGTGCAAAAAGGAAAAAAGACAAAAAAAATAAACTGACTTT^ 
ACTCGAATGAATTTCGAACCCTCCAGGACAAATTACGCAA^ 
GAA^GTCACAAAATAAAAATTCCAAGTAGGAAAAAACAAATAAGATGTT 
GCATCAACCCGGGGAAACACCACAGCAACCGCGCATGTGTACCCGCGACCAGTCCT 

CCGCAGCCAGCGT ATG TGTCCGCATCTGCCGACCCCGTCTTACATAGTC^TTTATGTATAATGTAGGTAATA TAATAGCT 
10 CGAGCTCGCTCCGCACCACCAATGTGCGTCGTGCAAGTCCATTCCAATTGTTATCCGGTC^ 

TTTCAGGTTGATTCGCGGCAATCCTTGGCCCATTGCAGAAACTCATCCAACGCGCTGA 

CTTCACACGCAGATTACGATCGGTTGTAATGAGCACCAATTCC^ 

AGTTAGAAATGCTATCAGGAAGGACATTTAACGGAAGCAGCTCACCTGT^ 

GCCATTGCCGTTGCCAGAATCTTGTCGTC^TTGGACAAATATTCCTCCTCAACTAGGG 
1 5 CGAGCCCTTTGTGGTAGCACATCTAGAAAAGAAATCAATAAGGTAT^^ 

ATTACTAClXiATTTAAAAGTAAAATTTCAATACAITATCAGGAAACTTTTATCTATCT 

CAGAATTATAAATAGCTAATCGCTAGTAAACCCTTTATCAGATATCAGTAATAAAGGAACT 

ATAATTAACAATAGCITACTTCACATTGCCTTT^ 

CGTCAAAGTGGTGGATGCGCTGCGTCTGCTTCGAACTC 

2 0 TCCTTAACCACTGTCAGTGGTATAATAAGTGTGTAGCGTTTAA^ 

GATAAAGC^GTTGGTATCCGGTAGAAGATAGCGCGGTCGCACCTa^TGTATATTTTCGTGTCCACGAA 

CCTCCAACTTGCTGGTGCATATCTGCTTAACCCTAGC 

TCGGAATCAGTTGAACGACACAACAATTCCTCAACGGCTC 

TTCAGCATCATAGTTTTCCAATGCGTTCTTCATCGCfTGCGTTTAGGTGCTCGAC^ 
25 GGAGAGCCTCTTGATGCTGGACGTAGTATTCCTGGAACTGACC^ 

GATCCTCTTTTGAGAT AGTCCATG CT CAG CGTTT CACGACCCAACX3GAGTGAAACCTCTAAGGGCCACATCCTCATC CAA 
CAGTATTTCGTTTTTCTCAAGTTTGTGCTTCCCCATTAAGCATTCAATGTACT 

AATCGATGAAAGAGTGTCTGCAATATTGGAGTTAATGAAAAACTAATAAAAGG CATTCAATTTATACATACT CTTCCGAT 
CTGACGGGTTCCCACACGTCCAAACTGATGCTCAACCAACGTACATAAACATTC7VC 

3 0 CGCAAAGCTGAAATAAAAGATTAATTAGCAATAATAAATAAACAAG^ 

TGGACATTTGTAGAATCCTCTAGATCTACTAGTCC 

38. SEQ ID NO:38 >GAL4-DHR96_DNA_ 



GAAGCAAGCCTCtaGAAAGATGAAGCTACTGTCTTCTATCGAACAAGCATGCGATATTTGCCGACT 
35 TAAAAAGCTCAAGTTCGcgatggcggcgaggaaaggg^ 

aaagaggctatatcggtgatggagaaggtoatcagctcacaaaaggacgccttaacagaggacgccatcgatataatgaacaagttcatgaataccccagctcgc 

ccggjgtcggggagcagcccagtacattctacgtacggtggatg^ 

atcatgtgtgccattgtcctttaatgjctamgaatgta^^ 

caaagcaacacaatctataagacgataatgcaataactaacttggaagcgtgggttctgtgcaaacc 
40 39. SEQ ID NO:39 >pET24c_Bam+XtaoJilled+DHR96 

TGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGC 

GTGACCGCTACACTTGTTAGGGTGATGGTTCITAATACAACCTATTAATTTCCCC^ 

AAGGTTATCAAGTGAGAAATCACCATGAGTGACGACTAACCGGCGCAGGAACACTGCCAGCGCA 

45 TCAACAATATTTTCACCTGAATCAGGATATGCTTCCCATACAATCGATAGATTGTCGCACCTGATT 
GCCCGACAGATCITCTrGAGATCCTTTTTTTCrGCGCGTTGGCGATAAGTC 
GCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATT^ 
CGCAGGG AGCTGCATGTGTCAG AGGTTTTC ACCGTC ATCACCG A A ACGCGCG AGGC AG CTGCGG 
CGATGAAACGAGAGAGGATGCTCACGATACGGGTTACTGATGATGAAACGGAAACCGAAGACCA 

50 TTCATGTTGTTGCTCAGAAGATTCCGAATACCGCAAGCGCTCACTGTCTTCGGTATCGTCGTATCC 
CACTACCGAGATATCCGCACCAACGCGCAGCCCGGACTCGGTAATGGCGCGCATTGCCGAGACA 
GAACTTAATGGGCCCGCTAACAGCGCGATTTGCTGGTGACCCAATGCGACCAGATCGCTTTACAG 
GCTTCGACGCCGCTTCGTTCTACCATCGACACCACCACGCTTCACCACGCGGGAAACGGTCTGAT 
AAGAGACACCGGAAGGAGATGGCGCCCAACAGTCCCTCTAGAAATAAAACCTTGACCACTACTC 

55 GGGGTCACAGGACTCGCAGAGCTGCGGCTCGGCGGACAGCGGGGCCAATGGGTGCTCCGGCACC 
TTAAGGCTGGTGTCGCATTTGATCGACTATCCAGGCGACGCACTCAAGATCATTTCAAAGTTTAG 
CTGCGCATTCTTAACCGAATCCTAAGCGGCGGAGGAGCGAACGCAGCCCAGCTACATAGCCAAC 
TCGCCGGACTTCGATCTGAAGACCTTCAAGCAACCCATCTGCGCCCCATCCACCCAGCATTCCGT 
GACAAACTATATCCGGAT 

60 40. SEQ ID NO:40 F96Xma 

5'-GAG AG ATGTGCTTCGTTA A AGCATCA ACCC 
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4.1. SEQ ID NO:41 R96SpeBgl 

5-GG ACTA GTAG ATCTAG AGGATTCTACA A ATGTCCAGTGTCTCCC 
42. SEQ ID NO:42 R96Int3 

S'-CCATTATTATCGCCATAATCGTAAAGG 
5 43. SEQ ID NO:43 R96EX3SCE 

S'-ATTACCCTGTTATCCCTAGCGGGTTACCTTAATGCGATCATCGCCC 

44. SEQ ID NO:44 R96endhind 

5-GGAAAGCTTTTCCTGCTGATCAATAATACC 

45. SEQ ID NO:45 FAPA96 

10 5*-TGGGCCCATCACTTGCTTGTAACCGCCGAAGAACTGCGCGG 

46. SEQ ID NO:46 F96INT3SCE 

5* CGCTAGGGATAACAGGGTAATAACAGTCCACGGTATTAGCCTATAGG 

47. SEQ ID NO:47 F96EX5Int3 

5* CGATTATGGCGATAATAATGGCCAAAGAGAACATGGGCAACATACGC 
15 48. SEQ ID NO:48 FGALXB 

S'-GAAGCAAGCCTCTAGAAAGATGAAGC 

49. SEQ ID NO:49 RGAL96 

S-CGTGCCGTTCTCCATCGATACAGTCAACTGTCTTTGACC 

50. SEQ ID NO:50 R96/936 

20 5 '-GCCTGG ATAGTCG ATCA A ATGCG 

51 . SEQ ID NO:51 F96BEG 

5-ATGGAGAACGGCACGGATGC 

52. SEQ ID NO:52 F96XBAi 

5'-TACATTCTAGAGACCAACTACAACGACGAGCCCAGTCTGG 
25 53. SEQ ID NO:53 R96BspEl 

5'-CATTCATCCGGACATTAATTATGAACTTGTTCAGACGCTCC 

54. SEQ ED NO:54 R96BspE2 

5-GGGCATCAACTCCGGAATTAAATGCCCGACACGCATCGG 

55. SEQ ID NO:55 RPAXCRE-AN 

30 5-GTCTCACGACGTTrrGAACCCAGAAATCGAGCTCGCCCGGGG 

56. SEQ ID NO:56 RPAXCRECO 

5-CACGAATTCCAAACTGTCTCACGACGTTTTGAACCC 

57. SEQ ID NO:57 FPAXFSE-AN 

S'-GAGAGCTAGCATGCCGGCTAGATCTCGAGATCGGCCGGCCTAGG 
35 58. SEQ ID NO:58 FPAXPOLY 

5'GAACTGCAGCTCGAGAGCTAGCATGCCGGC 

59. SEQ ID NO:59 F96ANhe 
S'-GGAGATATACATATGGCTAGCATGACTGGTGG 

60. SEQ ID NO:60 R96AHind 

40 y-TGCTCGAAGCTTCGCAGAAGATAATAGTAGG 
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V. CLAIMS 

What is claimed is: 

1 . A composition comprising an inhibitor of DHR96 activity. 

2. A composition comprising an inhibitor of DHR96 activity and a pesticide. 

3. An insect comprising a gene, wherein the gene comprises a mutation of the 
DHR96 gene. 

4. A method of enhancing the effect a pesticide has on an insect comprising 
administering t o the insect an inhibitor of DHR96 activity. 

5. A method of identifying a compound which will enhance the activity of a. 
pesticide comprising incubating the compound with an insect, and assaying if the 
compound inhibits DHR96 activity. 
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VI. ABSTRACT OF THE DISCLOSURE 
342. Disclosed are compositions and methods for modulating DHR96 activity and 
identifying molecules that modulate DHR96 activity. 
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^ ^ P 1 

Usp CSICG0RASGKHYGVYSCEGCKGFFKRTVRKOLTYA-CRENR-- NCIIDKRQRNRCQYCRYQKCLTCGMKREAVQEERQ 

Svp CVVCGDKSSGKHYGQFTCEGCKSFFKR5VRRNLTY5-CRGSR--NCPIDQHHRNQCQYCRLKKCLKMGMRREAVQRGRV 

dHNF4 CAICGDRATGKHYGASSCDGCKGFFRRSVRKNHQYT-CRFAR-- NCVVDKDKRNQCRYCRLRKCFKAGM 

0KR3 CKVCGDKSSGVHYGVITCEGCKGFFRRSQSSVVNYQ-CPRNK--QCVVDRVNRNRCQYCRLQKCLKLGM 

0HR39 CPVCGDKVSGYHYGLLTCESCKGFFKRTVQNKKVYT-CVAER--SCHIDKTQRKRCPYCRFQKCLEVGM 

E75 CRVCGDKASGFHYGVHSCEGCKGFFRRSIQQKIQYRPCTKNQ— QCSILRINRNRCQYCRLKKCIAVGM 

E78 CKVCGDKASGYHYGVTSCEGCKGFFRRSIQKQIEYR-CLROG — KCIVIRLNRNRCQYCRFKKCLSAGM 

EcR CLVCGDRASGYHYNALTCEGCKGFFRRSVTKSAVYC-CKFGR— ACEMDMYMRRKCQECRLKKCLAVGM 

FTZ-P1 CPVCGDKVSGYHYGLLTCESCKGFFKRTVQNKKVYT-CVAER — SCHIDKTQRKRCPYCRFQKCLEVGM 

Til CKVCRDHSSGKHYGIYACDGCAGFFKRSIRRSRQYV-CICSQKQGLCVVOICTHRNQCRACRLRKCFEVGM 

OHR38 CAVCGDTAACQHYGVRTCEGCKGFFKRTVQKGSKYV-CLAOK—NCPVDKRRRNRCQFCRFQKCLWGM 

DHR78 CLVCGDRASCRHYGAISCEGCKGFFKRSIRKQLGYQ-CRGAM — ^^^ v ^^ H ^!J^53C5nI'S^^nTrl3 

0HR96 CAVCGDKALGYNFNAVTCESCKAFFRR NALAKKQFT-C PFNQ-NCDITVVTRRFCQKCRLRKCLDIGM 

unique region 
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